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HUMAN MEMBRANE-ASSOCIATED PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human membrane-associated 
proteins and to the use of these sequences in the diagnosis, treatment, and prevention of cell signaling, 
cell differentiation, and cell proliferation disorders. 

BACKGROUND OF THE INVENTION 

Eukaryotic cells are surrounded by plasma membranes which enclose the cell and maintain an 
environment inside the cell that is distinct from its surroundings. In addition, eukaryotic organisms are 
distinct from prokaryotes in that they possess many intracellular organelle and vesicle structures. Many 
of the metabolic reactions which distinguish eukaryotic biochemistry from prokaryotic biochemistry 
take place within these structures. The plasma membrane and the membranes surrounding organelles 
and vesicles are composed of phosphoglycerides, fatty acids, cholesterol, phospholipids, glycolipids, 
proteoglycans, and proteins. These components confer identity arid functionality to the membranes with 
which they associate. 
Integral Membrane Proteins 

The majority of known integral membrane proteins are transmembrane proteins (TM) which 
are characterized by an extracellular, a transmembrane, and an intracellular domain. TM domains are 
typically comprised of 15 to 25 hydrophobic amino acids which are predicted to adopt an a -helical 
conformation. TM proteins are classified as bitopic (Types I and II) and polytopic (Types III and IV) 
(Singer, S.J. (1990) Annu. Rev. Cell Biol. 6:247-96). Bitopic proteins span the membrane once while 
polytopic proteins contain multiple membrane-spanning segments. TM proteins function as cell- 
surface receptors, receptor-interacting proteins, transporters of ions or metabolites, ion channels, cell 
anchoring proteins, and cell type-specific surface antigens. 

Many membrane proteins (MPs) contain amino acid sequence motifs that target these proteins 
to specific subcellular sites. Examples of these motifs include PDZ domains, KDEL, RGD, NGR, 
and GSL sequence motifs, von Willebrand factor A (vWFA) domains, and EGF-like domains. RGD, 
NGR, and GSL motif-containing peptides have been used as drug delivery agents in cancer treatments 
which target tumor vasculature (Arap, W. et al. (1998) Science, 279:377-380). Furthermore, MPs 
may also contain amino acid sequence motifs, such as the carbohydrate recognition domain (CRD), 
that mediate interactions with extracellular or intracellular molecules. 

In some cases TM proteins function as mediators of cell-cell attachment. For example, Emp 
(erythrocyte macrophage protein), which mediates attachment of erythroblasts to macrophages, has a 
putative TM domain near its N-terminus. It is postulated that Emp suppresses the process of 
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apoptosis by promoting terminal differentiation of erythroid cells when Emp-mediated contact is 
made between erythroblasts and macrophages (Hanspal, M. et al. (1998) Blood 92:2940-2950). 

One function of TM proteins is to facilitate cell-cell communication. Neurexins are a family 
of neuronal cell surface receptor proteins, with single TM regions, that aid in axon guidance and 
5 synaptogenesis. Neurexins exhibit extensive alternative splicing that produces hundreds of unique 
neurexins in the brain (Ushkaryov, Y.A. et. al. (1992) Science 257:50-56). 

In some cases TM proteins serve as transporters or channels in the cell membrane. The Rh 
(Rhesus) erythrocyte blood group protein family serve such a function in erythrocyte membranes. 
The family includes both Rh50 glycoprotein and Rh30 polypeptides that together form a complex 

10 essential for Rh antigen expression and erythrocyte membrane integrity. Mutations in Rh50 underlie 
Rh deficiency syndrome, an autosomal recessive disorder associated with chronic hemolytic anemia 
and spherostomatocytosis (Huang, C.H. (1998) J. Biol. Chem. 273:2207-2213). 

Tumor antigens are cell surface molecules that are differentially expressed in tumor cells 
relative to normal cells. Tumor antigens distinguish tumor cells immunologically from normal cells 

15 and provide diagnostic and therapeutic targets for human cancers. (Takagi, S. et al. (1995) Int. J. 
Cancer 61: 706-715; Liu, E. et al. (1992) Oncogene 7: 1027-1032.) 

Other types of cell surface antigens include those identified on leukocytic cells of the immune 
system. These antigens have been identified using systematic, monoclonal antibody (mAb)-based 
"shot gun" techniques. These techniques have resulted in the production of hundreds of mAbs 

20 directed against unknown cell surface leukocytic antigens. These antigens have been grouped into 
"clusters of differentiation" based on common immunocytochemical localization patterns in various 
differentiated and undifferentiated leukocytic cell types. Antigens in a given cluster are presumed to 
identify a single cell surface protein and are assigned a "CD" or "cluster of differentiation" 
designation. Some of the genes encoding proteins identified by CD antigens have been cloned and 

25 verified by standard molecular biology techniques. CD antigens have been characterized as both 
transmembrane proteins and cell surface proteins anchored to the plasma membrane via covalent 
attachment to fatty acid-containing glycolipids such as glycosylphosphatidylinositol (GPI), discussed 
below. (Reviewed in Barclay, A. N. et al. (1995) The Leucocyte Antigen Facts Book . Academic Press, 
San Diego, CA, pp. 17-20.) 

30 The TM cell surface glycoprotein CD69 is an early activation antigen of T lymphocytes. 

CD69 is homologous to members of a supergene family of type II integral membrane proteins having 
C-type lectin domains. Although the precise functions of the CD-69 antigen is not known, evidence 
suggests that these proteins transmit mitogenic signals across the plasma membrane and are up- 
regulated in response to lymphocyte activation (Hammann, J. et. al. (1993) J. Immunol. 150:4920- 

35 4927). 



WO 00/65054 



PCT/USOO/10884 



Peripheral and Anchored Membrane Proteins 

Some membrane proteins are not membrane-spanning but are attached to the plasma 
membrane via membrane anchors or interactions with integral membrane proteins. Membrane 
anchors are covalently joined to a protein post-translationally and include such moieties as prenyl, 
myristyl, and glycosylphosphatidyl inositol (GPI) groups. Membrane localization of peripheral and 
anchored proteins is important for their function in processes such as receptor-mediated signal 
transduction. For example, prenylation of Ras is required for its localization to the plasma membrane 
and for its normal and oncogenic functions in signal transduction. 

The Ly-6 antigens comprise a family of membrane- bound proteins expressed primarily on the 
surface of T-lymphocytes and, to a lesser extent, on the surface of other leukocytes and leukocyte 
precursors. (Reviewed in Barclay, A. N. et al. (1995) The Leucocyte Antigen Facts Book . Academic 
Press, San Diego, CA, pp. 352-354; Friedman, S. et al. (1990) Immunogenetics 31 :104-1 IT) Ly-6 
antigens are attached to the cell surface via GPI anchors. Ly-6 antigens are each about 1 35 amino 
acids in length and about 50% identical to one another. Although the precise functions of the Ly-6 
antigens are not known, evidence suggests that these proteins transmit mitogenic signals across the 
plasma membrane and are up-regulated in response to lymphocyte activation. 

An example of a peripheral membrane protein is the mucin glycoprotein ASGP-1 (ascites 
sialoglycoprotein-1) from cell line 13762 rat mammary adenocarcinoma. ASGP-1 binds to a 
transmembrane-spanning partner to form a heterodimeric complex on the cell surface. It is proposed 
that this complex affects the process of tumor progression by modulating growth factor activity (Wu, 
KL et al. (1994) J. Biol. Chem. 269:1 1950-1 1955). 

The discovery of new human membrane-associated proteins and the polynucleotides encoding 
them satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of cell signaling, cell differentiation, and cell proliferation disorders. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, human membrane-associated proteins, referred to 
collectively as "HUMAP" and individually as "HUMAP-1," "HUMAP-2," "HUMAP-3," "HUMAP- 
4," "HUMAP-5," "HUMAP-6," "HUMAP-7," "HUMAP-8," "HUMAP-9," "HUMAP-10," 
"HUMAP-11," "HUMAP-12," "HUMAP- 1 3," "HUMAP- 14," "HUMAP-15," "HUMAP-16," and 
"HUMAP-17." In one aspect, the invention provides an isolated polypeptide comprising a) an amino 
acid sequence selected from the group consisting of SEQ ID NO:l~17, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-17, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-1 7, or d) an immunogenic fragment of an amino acid 
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sequence selected Irom the group consisting of SEQ ID NO: 1-17. In one altcrnati ve. the invention 
provides an isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 1-1 7. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1 - 1 7, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-17, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-1 7, or d) an immunogenic fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-1 7. In one alternative, 
the polynucleotide is selected from the group consisting of SEQ ID NO: 18-34. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-17, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-1 7, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-17, or d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-17. In one alternative, the invention 
provides a cell transformed with the recombinant polynucleotide. In another alternative, the invention 
provides a transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising a) an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-17, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-17, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-17, or d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-17. The method comprises a) culturing a 
cell under conditions suitable for expression of the polypeptide, wherein said cell is transformed with a 
recombinant polynucleotide comprising a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
17, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-17, c) a biologically active fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO: 1 - 1 7, or d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
17. 
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The invention further provides an isolated polynucleotide comprising a) a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 18-34, b) a naturally occurring 
polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 18-34, c) a polynucleotide sequence complementary to a), or 
5 d) a polynucleotide sequence complementary to b). In one alternative, the polynucleotide comprises at 
least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a) a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 18-34, b) a naturally occurring 

10 polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 18-34, c) a polynucleotide sequence complementary to a), or 
d) a polynucleotide sequence complementary to b). The method comprises a) hybridizing the sample 
with a probe comprising at least 16 contiguous nucleotides comprising a sequence complementary to 
said target polynucleotide in the sample, and which probe specifically hybridizes to said target 

15 polynucleotide, under conditions whereby a hybridization complex is formed between said probe and 
said target polynucleotide, and b) detecting the presence or absence of said hybridization complex, and 
optionally, if present, the amount thereof. In one alternative, the probe comprises at least 30 contiguous 
nucleotides. In another alternative, the probe comprises at least 60 contiguous nucleotides. 

The invention further provides a pharmaceutical composition comprising an effective amount 

20 of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-1 7, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-17, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-1 7, or d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:l- 

25 17, and a pharmaceutical^ acceptable excipient The invention additionally provides a method of 

treating a disease or condition associated with decreased expression of functional HUMAP, comprising 
administering to a patient in need of such treatment the pharmaceutical composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting of 

30 SEQ ID NO: 1-17, b) a naturally occurring amino acid sequence having at least 90% sequence identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-17, c) a biologically 
active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-1 7, or 
d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-17. The method comprises a) exposing a sample comprising the polypeptide to a compound, 
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and b) detecting agonist activity in the sample. In one alternative, the invention provides a 
pharmaceutical composition comprising an agonist compound identified by the method and a 
pharmaceutical^ acceptable excipient. In another alternative, the invention provides a method of 
treating a disease or condition associated with decreased expression of functional HUMAP, 
5 comprising administering to a patient in need of such treatment the pharmaceutical composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-17, b) a naturally occurring amino acid sequence having at least 90% sequence 
identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-17, c) a 

10 biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-1 7, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-1 7. The method comprises a) exposing a sample comprising the polypeptide to a 
compound, and b) detecting antagonist activity in the sample. In one alternative, the invention 
provides a pharmaceutical composition comprising an antagonist compound identified by the method 

15 and a pharmaceutically acceptable excipient. In another alternative, the invention provides a method 
of treating a disease or condition associated with overexpression of functional HUMAP, comprising 
administering to a patient in need of such treatment the pharmaceutical composition. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 

20 sequence selected from the group consisting of SEQ ID NO: 18-34, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 



BRIEF DESCRIPTION OF THE TABLES 

25 Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 

clone identiOcation numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding HUMAP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of HUMAP. 

30 Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 

patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
conditions associated with these tissues; and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding HUMAP were isolated. 

35 Table 5 shows the tools, programs, and algorithms used to analyze HUMAP, along with 
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applicable descriptions, references, and threshold parameters. 



DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
5 that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
10 and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
15 as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
20 connection with the invention. Nothing herein is to be construed as an admission that the invention is 
not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"HUMAP" refers to the amino acid sequences of substantially purified HUMAP obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 

25 human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
HUMAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of HUMAP either by directly interacting with 
HUMAP or by acting on components of the biological pathway in which HUMAP participates. 

30 An "allelic variant" is an alternative form of the gene encoding HUMAP. Allelic variants may 

result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
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Each of these types of chances may occur alone, or in combination with the others, one or more times in 
a given sequence. 

"Altered" nucleic acid sequences encoding HUMAP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as HUMAP or a 
5 polypeptide with at least one functional characteristic of HUMAP. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding HUMAP, and improper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
HUMAP. The encoded protein may also be "altered," and may contain deletions, insertions, or 

10 substitutions of amino acid residues which produce a silent change and result in a functionally 

equivalent HUMAP. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of HUMAP is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 

15 amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 

20 polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to an amino acid sequence of a naturally 
occurring protein molecule, "amino acid sequence** and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 
"Amplification" relates to the production of additional copies of a nucleic acid sequence. 

25 Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 
in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
HUMAP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of HUMAP either by 
30 directly interacting with HUMAP or by acting on components of the biological pathway in which 
HUMAP participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind HUMAP polypeptides can be prepared using intact polypeptides or using 
35 fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
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oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RN A, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
5 the animal. 

The term "antigenic determinant'" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 

10 protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" strand 
of a specific nucleic acid sequence. Antisense compositions may include DNA; RN A; peptide nucleic 
acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, 

15 methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2'- 
methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having modified bases such as 5- 
methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense molecules may be produced 
by any method including chemical synthesis or transcription. Once introduced into a cell, the 
complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence 

20 produced by the cell to form duplexes which block either transcription or translation. The designation 
"negative** or "minus" can refer to the antisense strand, and the designation "positive" or "plus" can 
refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical r 
functions of a naturally occurring molecule. Likewise, "immunologically active" refers to the capability 

25 of the natural, recombinant, or synthetic HUMAP, or of any oligopeptide thereof, to induce a specific 
immune response in appropriate animals or cells and to bind with specific antibodies. 

The terms "complementary" and "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence "5* A-G-T 3'" bonds to the complementary 
sequence "3* T-C-A 5\" Complementarity between two single-stranded molecules may be "partial," 

30 such that only some of the nucleic acids bind, or it may be "complete," such that total complementarity 
exists between the single stranded molecules. The degree of complementarity between nucleic acid 
strands has significant effects on the efficiency and strength of the hybridization between the nucleic 
acid strands. This is of particular importance in amplification reactions, which depend upon binding 
between nucleic acid strands, and in the design and use of peptide nucleic acid (PNA) molecules. 
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A ' composition comprising a given polynucleotide sequence" and a ' composition comprising a 
given amino acid sequence** refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding HUM AP or fragments of HUMAP may 
be employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardt's solution, dry milk^ salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to resolve 
uncalled bases, extended using the XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5 1 and/or the 3' 
direction, and resequenced, or which has been assembled from the overlapping sequences of one or 
more Incyte Clones and, in some cases, one or more public domain ESTs, using a computer program 
for fragment assembly, such as the GEL VIEW fragment assembly system (GCG. Madison WI). Some 
sequences have been both extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that, when made, least interfere 
with the properties of the original protein, i.e., the structure and especially the function of the protein is 
conserved and not significantly changed by such substitutions. The table below shows amino acids 
which may be substituted for an original amino acid in a protein and which are regarded as conservative 
amino acid substitutions. 



Original Residue 


Conservative Substitution 


Ala 


Gly, Ser 


Arg 


His, Lys 


Asn 


Asp, Gin, His 


Asp 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Glu, His 


Glu 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


He 


Leu, Val 


Leu 


He, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, He 


Phe 


His, Met, Leu, Trp, Tyr 


Ser 


Cys, Thr 


Thr 


Ser, Val 


Trp 


Phe, Tyr 


Tyr 


His, Phe, Trp 


Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
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backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion"' refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative*' refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, hydroxyl, or amino group. A derivative 
polynucleotide encodes a polypeptide which retains at least one biological or immunological function of 
the natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide from 
which it was derived. 

A "fragment" is a unique portion of HUMAP or the polynucleotide encoding HUM AP which 
is identical in sequence to but shorter in length than the parent sequence. A fragment may comprise 
up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A 
fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 
5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or 
amino acid residues in length. Fragments may be preferentially selected from certain regions of a 
molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino 
acids selected from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown 
in a certain defined sequence. Clearly these lengths are exemplary, and any length that is supported 
by the specification, including the Sequence Listing, tables, and figures, may be encompassed by the 
present embodiments. 

A fragment of SEQ ID NO: 1 8-34 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO: 18-34, for example, as distinct from any other sequence in the 
same genome. A fragment of SEQ ID NO: 18-34 is useful, for example, in hybridization and 
amplification technologies and in analogous methods that distinguish SEQ ID NO: 18-34 from related 
polynucleotide sequences. The precise length of a fragment of SEQ ID NO: 1 8-34 and the region of 
SEQ ID NO: 1 8-34 to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1 - 1 7 is encoded by a fragment of SEQ ID NO: 1 8-34. A fragment 
of SEQ ID NO: 1-17 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO: 1-17. For example, a fragment of SEQ ID NO: 1-1 7 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO:l-17. The precise length of 
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a fragment of SEQ ID NO: 1-1 7 and the region of SEQ ID NO: 1-17 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

The term ' similarity" refers to a degree of complementarity. There may be partial similarity or 
5 complete similarity. The word "identity" may substitute for the word ' similarity/' A partially 
complementary sequence that at least partially inhibits an identical sequence f rom hybridizing to a 
target nucleic acid is referred to as "substantially similar/* The inhibition of hybridization of the 
completely complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or northern blot, solution hybridization, and the like) under conditions of reduced 

10 stringency. A substantially similar sequence or hybridization probe will compete for and inhibit the 
binding of a completely similar (identical) sequence to the target sequence under conditions of reduced 
stringency. This is not to say that conditions of reduced stringency are such that non-specific binding is 
permitted, as reduced stringency conditions require that the binding of two sequences to one another be 
a specific (i.e., a selective) interaction. The absence of non-specific binding may be tested by the use of 

15 a second target sequence which lacks even a partial degree of complementarity (e.g., less than about 

30% similarity or identity). In the absence of non-specific binding, the substantially similar sequence or 
probe will not hybridize to the second non-complementary target sequence. 

The phrases "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 

20 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e sequence 

25 alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 

30 selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequence pairs. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 

35 sources, including the NCBI, Bethesda, MD, and on the Internet at 
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http://www.ncbi.nlm.nih.cov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
5 Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example: 
10 Matrix: BLOSUM62 



Word Size: 11 
Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 

20 the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

25 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 



30 the percentage of residue matches between at least two polypeptide sequences aligned using a 

standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the hydrophobicity and acidity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 



15 



Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 



The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
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Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3. 12e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences usine 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved ' =5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the ' BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) with blastp set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Open Gap: 1 1 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid sequence 
in the non-antigen binding regions has been altered so that the antibody more closely resembles a human 
antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific 
hybridization complexes form under permissive annealing conditions and remain hybridized after the 
"washing" step(s). The washing step(s) is particularly important in determining the stringency of the 
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hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding 
between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
5 experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1 % (w/v) 
SDS, and about 100 ug/ml denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Generally, such wash temperatures are selected to be about 

10 5°C to 20°C lower than the thermal melting point (T^ for the specific sequence at a defined ionic 

strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 
for nucleic acid hybridization are well known and can be found in Sambrook et al., 1989, Molecular 
Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 

15 see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions ol 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 

20 reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as formamide at a 
concentration of about 35-50% v/v, may also be used under particular circumstances, such as for 
RNAiDNA hybridizations. Useful variations on these wash conditions will be readily apparent to those 
of ordinary skill in the art. Hybridization, particularly under high stringency conditions, may be 

25 suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative of a 
similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., C 0 t or R 0 t analysis) or formed between one nucleic acid 

30 sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 
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"Immune response * can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 



capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment 7 * also includes any polypeptide or oligopeptide fragment of 
HUMAP which is useful in any of the antibody production methods disclosed herein or known in the 
art. 



The terms "element" and "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of HUMAP. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 

15 functional, or immunological properties of HUMAP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RN A of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

20 "Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 

functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 
where necessary to join two protein coding regions, in the same reading frame. 

25 "Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 

comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

30 "Probe" refers to nucleic acid sequences encoding HUMAP, their complements, or fragments 

thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 
short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 

35 complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 



5 



An "immunogenic fragment" is a polypeptide or oligopeptide fragment of HUMAP which is 
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The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 
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polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
5 be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 

10 example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel et al.,1987, Current Protocols in Molecular Biology , 
Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols, A Guide 
to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs can be derived from 
a known sequence, for example, by using computer programs intended for that purpose such as Primer 

15 (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLiGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 

20 programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 

25 Cambridge MA) allows the user to input a "misprinting library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 

30 UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
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primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
5 that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 

10 recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be pan of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

15 An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 

20 acids encoding HUMAP, or fragments thereof, or HUMAP itself, may comprise a bodily fluid; an 
extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, 
RNA, or cDNA, in solution or bound to a substrate: a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

25 synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide containing the epitope 
A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labeled A that binds to the antibody. 

30 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated 

A "substitution" refers to the replacement of one or more amino acids or nucleotides by 
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different amino acids or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"Transformation'" describes a process by which exogenous DNA enters and changes a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 
of host cell being transformed and may include, but is not limited to, viral infection, electroporation, 
heat shock, lipofection, and particle bombardment. The term "transformed" cells includes stably 
transformed cells in which the inserted DNA is capable of replication either as an autonomously 
replicating plasmid or as part of the host chromosome, as well as transiently transformed cells which 
express the inserted DNA or RN A for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, and plants and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra, 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 
identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
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due to alternate splicing of exons during niRNA processing. The corresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants arc polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A polymorphic 
5 variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 

10 least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-l999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 707o, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

15 THE INVENTION 

The invention is based on the discovery of new human membrane-associated proteins 
(HUMAP), the polynucleotides encoding HUMAP, and the use of these compositions for the diagnosis, 
treatment, or prevention of cell signaling, cell differentiation, and cell proliferation disorders. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 

20 HUMAP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 

polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones 
in which nucleic acids encoding each HUMAP were identified, and column 4 shows the cDNA libraries 
from which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDN A 
libraries. Clones for which cDNA libraries are not indicated were derived from pooled cDNA libraries. 

25 The Incyte clones in column 5 were used to assemble the consensus nucleotide sequence of each 
HUMAP and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 

30 sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 

shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical methods 
and in some cases, searchable databases to which the analytical methods were applied. The methods of 
column 7 were used to characterize each polypeptide through sequence homology and protein motifs. 
The columns of Table 3 show trie tissue-specificity and diseases, disorders, or conditions 

35 associated with nucleotide sequences encoding HUMAP. The first column of Table 3 lists the 
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nucleotide SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These 
fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID 
NO: 18-34 and to distinguish between SEQ ID NO: 18-34 and related polynucleotide sequences. The 
polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 3 
5 lists tissue categories which express HUM AP as a fraction of total tissues expressing HUMAP. 

Column 4 lists diseases, disorders, or conditions associated with those tissues expressing HUMAP as a 
fraction of total tissues expressing HUMAP. Column 5 lists the vectors used to subclone each cDNA 
library. 

The columns of Table 4 show descriptions of the tissues used to construct the cDN A libraries 
10 from which cDNA clones encoding HUMAP were isolated. Column 1 references the nucleotide SEQ 
ID NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 
shows the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ID NO:31 maps to chromosome 4 within the interval from 77.30 to 88.50 centiMorgans. 
The invention also encompasses HUMAP variants. A preferred HUMAP variant is one which 
15 has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the HUMAP amino acid sequence, and which contains at least one functional or 
structural characteristic of HUMAP. 

The invention also encompasses polynucleotides which encode HUMAP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
20 the group consisting of SEQ ID NO: 18-34, which encodes HUMAP. The polynucleotide sequences of 
SEQ ID NO: 18-34, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 
is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding HUMAP, In 
25 particular, such a variant polynucleotide sequence will have at least about 80%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding HUMAP. A particular aspect of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO: 18-34 which has at 
least about 80%, or alternatively at least about 85%, or even at least about 95% polynucleotide 
30 sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO: 18-34. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of HUMAP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of polynucleotide sequences encoding HUMAP, some bearing minimal similarity to 
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the polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
5 occurring HUMAP, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode HUMAP and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring HUMAP under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding HUMAP or 
its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 

10 occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding HUMAP and its derivatives without altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, such as a greater half-life, 

15 than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode HUMAP and 
HUMAP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 

20 mutations into a sequence encoding HUMAP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:18-34 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

25 152:507-5 11.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of 
DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Perkin-Elmer), 
30 thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or combinations of 
polymerases and proofreading exonucleases such as those found in the ELONGASE amplification 
system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is automated with 
machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal 
cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Perkin-Elmer). 
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Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Perkin- 
Elmer), the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA). or 
other systems known in the art. The resulting sequences are analyzed using a variety of algorithms 
which are well known in the an. (See, e.g., Ausubel, F.M. (1997) Short Protocols in Molecular 
Biology . John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and 
Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding HUMAP may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before pa-forming PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 
Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 
72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 
does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
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sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), and the entire process 
from loading of samples to computer analysis and electronic data display may he computer controlled. 
Capillary electrophoresis is especially preferable for sequencing small DNA fragments which may be 
present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode HUMAP may be cloned in recombinant DNA molecules that direct expression of HUMAP, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express HUMAP. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter HUM AP-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of HUMAP, such as its biological or enzymatic activity or its 
ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of 
gene variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial** 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 
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In another embodiment, sequences encoding HUMAP may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g.. Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
HUMAP itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solid-phase techniques. (See, e.g., Roberge, J.Y. et al. (1995) 
Science 269:202-204.) Automated synthesis may be achieved using the ABI 431 A peptide synthesizer 
(Perkin-Elmer). Additionally, the amino acid sequence of HUMAP, or any part thereof, may be altered 
during direct synthesis and/or combined with sequences from other proteins, or any part thereof, to 
produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, T. (1984) Proteins. Structures and Molecular Properties . WH Freeman, New 
York NY.) 

In order to express a biologically active HUMAP, the nucleotide sequences encoding HUMAP 
or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3 ? untranslated regions in the vector and in 
polynucleotide sequences encoding HUMAP. Such elements may vary in their strength and specificity. 
Specific initiation signals may also be used to achieve more efficient translation of sequences encoding 
HUMAP. Such signals include the ATG initiation codon and adjacent sequences, e.g. the Kozak 
sequence. In cases where sequences encoding HUMAP and its initiation codon and upstream regulatory 
sequences are inserted into the appropriate expression vector, no additional transcriptional or 
translational control signals may be needed. However, in cases where only coding sequence, or a 
fragment thereof, is inserted, exogenous translational control signals including an in-frame ATG 
initiation codon should be provided by the vector. Exogenous translational elements and initiation 
codons may be of various origins, both natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of enhancers appropriate for the particular host cell system used. (See, e.g., 
Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the an may be used to construct expression 
vectors containing sequences encoding HUMAP and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A 
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Laboratory Manual , Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
al. (1995) Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
5 encoding HUMAP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 

10 animal cell systems. The invention is not limited by the host cell employed. 

In bacteria] systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding HUMAP. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding HUMAP can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 

15 (Life Technologies). Ligation of sequences encoding HUMAP into the vector's multiple cloning site 
disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 
deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 

20 264:5503-5509.) When large quantities of HUMAP are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of HUMAP may be used. For example, vectors containing 
the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of HUMAP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 

25 promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra ; 
Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184.) 

30 Plant systems may also be used for expression of HUMAP. Transcription of sequences 

encoding HUMAP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
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(1984) Science 224:838-843: and Winter, J. et al. (1991 ) Results Probl. Cell Differ. 17:85-105.) These 
constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 
transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, 
New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding HUMAP may be ligated into 
an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses HUMAP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81 :3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J J. et al. (1997) Nat. Genet. 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
HUMAP in cell lines is preferred. For example, sequences encoding HUMAP can be transformed into 
cell lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 
but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in Or and apr cells, respectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1 :223-232; 
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to 
chloTsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et al. (1980) 
Proc. Natl. Acad Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
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lor metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), 6 
glucuronidase and its substrate 6-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. 
(1995) Methods Mol. Biol. 55.121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding HUMAP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding HUMAP can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding HUMAP under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding HUMAP and that express 
HUMAP may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of HUMAP using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on HUMAP is preferred, but a competitive binding 
assay may be employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul MN, Sect. IV; Coligan, J.E. 
et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley-Interscience, New 
York NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding HUMAP include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding HUMAP, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
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such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease 
of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding HUMAP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode HUMAP may be designed to contain signal sequences which direct 
secretion of HUMAP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 
Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding HUMAP may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric HUMAP protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of HUMAP activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the HUMAP encoding sequence and the heterologous protein sequence, so that 
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HUMAP may be cleaved away from the heterologous moiety following purification. Methods for 
fusion protein expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of 
commercially available kits may also be used to facilitate expression and purification of fusion proteins. 
In a further embodiment of the invention, synthesis of radiolabeled HUMAP may be achieved in 
5 vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

Fragments of HUMAP may be produced not only by recombinant means, but also by direct 
10 peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) Protein 
synthesis may be performed by manual techniques or by automation. Automated synthesis may be 
achieved, for example, using the ABI 431 A peptide synthesizer (Perkin-Elmer). Various fragments of 
HUMAP may be synthesized separately and then combined to produce the full length molecule. 
THERAPEUTICS 

15 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 

between regions of HUMAP and human membrane-associated proteins. In addition, the expression of 
HUMAP is closely associated with cell proliferation, cancer, and inflammation. Therefore, HUMAP 
appears to play a role in cell signaling, cell differentiation, and cell proliferation disorders. In the 
treatment of disorders associated with increased HUMAP expression or activity, it is desirable to 

20 decrease the expression or activity of HUMAP. In the treatment of disorders associated with 
decreased HUMAP expression or activity, it is desirable to increase the expression or activity of 
HUMAP. 

Therefore, in one embodiment, HUMAP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 

25 activity of HUMAP. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 

30 the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 

gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a cell differentiation disorder 
including developmental disorders such as renal tubular acidosis, anemia, Cushing's syndrome, 
achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, 

35 WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation), 
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Smith-Magerus syndrome, myelodysplasia syndrome, hereditary mucoepithelial dysplasia, hereditary 
keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, 
hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, 
spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing 
5 loss and disorders of immune cell activation; and a disorder of cell signaling including endocrine 
disorders such as disorders of the hypothalamus and pituitary resulting from lesions such as primary 
brain tumors, adenomas, infarction associated with pregnancy, hypophysectomy. aneurysms, vascular 
malformations, thrombosis, infections, immunological disorders, and complications due to head 
trauma; disorders associated with hyperpituitarism including acromegaly, giantism, and syndrome of 

10 inappropriate antidiuretic hormone (ADH) secretion (SIADH) often caused by benign adenoma; 
disorders associated with hypothyroidism including goiter, myxedema, acute thyroiditis associated 
with bacterial infection; disorders associated with hyperparathyroidism including Conn disease 
(chronic hypercalemia); pancreatic disorders such as Type I or Type II diabetes mellitus and 
associated complications; disorders associated with the adrenals such as hyperplasia, carcinoma, or 

15 adenoma of the adrenal cortex, hypertension associated with alkalosis; disorders associated with 
gonadal steroid hormones such as: in women, abnormal prolactin production, infertility, 
endometriosis, perturbations of the menstrual cycle, polycystic ovarian disease, hyperprolactinemia, 
isolated gonadotropin deficiency, amenorrhea, galactorrhea, hermaphroditism, hirsutism and 
virilization, breast cancer, and, in postmenopausal women, osteoporosis; and, in men, Leydig cell 

20 deficiency, male climacteric phase, and germinal cell aplasia, hypergonadal disorders associated with 
Leydig cell tumors, androgen resistance associated with absence of androgen receptors, syndrome of 5 
a -reductase, and gynecomastia. 

In another embodiment, a vector capable of expressing HUMAP or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

25 expression or activity of HUMAP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
HUMAP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of HUMAP including, but not 
limited to, those provided above. 

30 In still another embodiment, an agonist which modulates the activity of HUMAP may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of HUMAP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of HUMAP may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of HUMAP. Examples of such 

35 disorders include, but are not limited to, those cell signaling, cell differentiation, and cell proliferation 
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disorders described above. In one aspect, an antibody which specifically binds HUMAP may be used 
directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissues which express HUMAP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding HUMAP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of HUMAP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of HUMAP may be produced using methods which are generally known in the 
art. In particular, purified HUMAP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind HUMAP. Antibodies to HUMAP may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with HUMAP or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
HUMAP have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein and contain the 
entire amino acid sequence of a small, naturally occurring molecule. Short stretches of HUMAP amino 
acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 
may be produced. 
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Monoclonal antibodies to HUMAP may be prepared using any technique which provides lor 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., KohJer, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
5 Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 

10 Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce HUMAP-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, 

15 D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

20 Antibody fragments which contain specific binding sites for HUMAP may also be generated. 

For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the 
F(ab*)2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 

25 (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between HUMAP and its 

30 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering HUMAP epitopes is generally used, but a competitive binding assay may also be 
employed (Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for HUMAP. Affinity is expressed as an association 
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constant, Ka, which is defined as the molar concentration of HUM AP-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple 
HUMAP epitopes, represents the average affinity, or avidity, of the antibodies for HUMAP. The 
5 determined for a preparation of monoclonal antibodies, which are monospecific for a particular 

HUMAP epitope, represents a true measure of affinity. High-affinity antibody preparations with 1^ 
ranging from about 10 9 to 10 i2 L/mole are preferred for use in immunoassays in which the HUMAP- 
antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with 
ranging from about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar 
10 procedures which ultimately require dissociation of HUMAP, preferably in active form, from the 

antibody (Catty, D. (1988) Antibodies, Volume I: A Practical Approach , IRL Press, Washington, DC; 
Liddell, J.E. and Cryer, A. (1991) A Practical Guide to Monoclonal Antibodies , John Wiley & Sons, 
New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 

15 the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of HUMAP-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 

20 Coligan et al. supra .) 

In another embodiment of the invention, the polynucleotides encoding HUMAP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the complement 
of the polynucleotide encoding HUMAP may be used in situations in which it would be desirable to 
block the transcription of the mRNA. In particular, cells may be transformed with sequences 

25 complementary to polynucleotides encoding HUMAP. Thus, complementary molecules or fragments 
may be used to modulate HUMAP activity, or to achieve regulation of gene function. Such technology 
is now well known in the art, and sense or antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding HUMAP. 
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or 

30 from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, 
tissue, or cell population. Methods which are well known to those skilled in the art can be used to 
construct vectors to express nucleic acid sequences complementary to the polynucleotides encoding 
HUMAP. (See, e.g., Sambrook, supra ; Ausubel, 1995, supra .) 

Genes encoding HUMAP can be turned off by transforming a cell or tissue with expression 
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vectors which express high levels of a polynucleotide, or fragment thereof, encoding HUM AP. Such 
constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in the 
absence of integration into the DNA, such vectors may continue to transcribe RNA molecules until they 
are disabled by endogenous nucleases. Transient expression may last for a month or more with a 
5 non-replicating vector, and may last even longer if appropriate replication elements are part of the 
vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or regulatory 
regions of the gene encoding HUMAP. Oligonucleotides derived from the transcription initiation site, 

10 e.g., between about positions -10 and +10 from the start site, may be employed. Similarly, inhibition 
can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it 
causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 

15 Molecular and Immunologic Approaches , Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 

complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 

20 molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding HUMAP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 

25 GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

30 Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 

any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding HUMAP. Such DNA sequences may be incorporated into a wide variety of vectors with 
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suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, constitulively or inducibly, can be introduced into cell lines, cells, or 
tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of Hanking sequences at the 5* and/or 3* ends 
of the molecule, or the use of phosphorothioate or 2" O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 
use in vivo, in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical or 
sterile composition, in conjunction with a pharmaceutical^ acceptable carrier, for any of the therapeutic 
effects discussed above. Such pharmaceutical compositions may consist of HUMAP, antibodies to 
HUMAP, and mimetics, agonists, antagonists, or inhibitors of HUMAP. The compositions may be 
administered alone or in combination with at least one other agent, such as a stabilizing compound, 
which may be administered in any sterile, biocompatible pharmaceutical carrier including, but not 
limited to, saline, buffered saline, dextrose, and water. The compositions may be administered to a 
patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any number 
of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain suitable 
pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing 
of the active compounds into preparations which can be used pharmaceutical^. Further details on 
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techniques tor formulation and administration may be found in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using pharmaceutical^ 
acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers 
5 enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, 
gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active compounds 
with solid excipient and processing the resultant mixture of granules (optionally, after grinding) to 
obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable excipients include 

10 carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, and sorbitol; starch 
from corn, wheat, rice, potato, or other plants: cellulose, such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including arabic and 
tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and alginic acid or a salt thereof, 

15 such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for product identification or to 

20 characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, 

25 the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or 
liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's 
solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances 

30 which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily 
injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, such as sesame oil, or 
synthetic fatty acid esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic 
amino polymers may also be used for delivery. Optionally, the suspension may also contain suitable 
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stabilizers or agents to increase the solubility of the compounds and allow for the preparation of highly 
concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the an. 

The pharmaceutical compositions of the present invention may be manufactured in a manner 
that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, 
levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many acids, 
including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acids. 
Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free base 
forms. In other cases, the preparation may be a lyophilized powder which may contain any or all of the 
following: 1 mM to 50 mM histidine, 0.1% to 2% sucrose, and 2% to 7% mannitol, at a pH range of 
4.5 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an appropriate 
container and labeled for treatment of an indicated condition. For administration of HUMAP, such 
labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein the 
active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, or pigs. 
An animal model may also be used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
HUMAP or fragments thereof, antibodies of HUMAP, and agonists, antagonists or inhibitors of 
HUMAP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such 
as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the 
dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 50 /ED 50 ratio. Pharmaceutical compositions which 
exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 with little 
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or no toxicity. The dosage varies within this range depending upon the dosage form employed, the 
sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the subject 
requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
5 moiety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. 
Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

10 Normal dosage amounts may vary from about 0. 1 ^g to 100,000 ^g, up to a total dose of 

about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

15 conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind HUMAP may be used for the 
diagnosis of disorders characterized by expression of HUMAP, or in assays to monitor patients being 
treated with HUMAP or agonists, antagonists, or inhibitors of HUMAP. Antibodies useful for 

20 diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for HUMAP include methods which utilize the antibody and a label to detect 
HUMAP in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in the 

25 art and may be used 

A variety of protocols for measuring HUMAP, including ELISAs, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of HUMAP expression. 
Normal or standard values for HUMAP expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibody to 

30 HUMAP under conditions suitable for complex formation. The amount of standard complex formation 
may be quantitated by various methods, such as photometric means. Quantities of HUMAP expressed 
in subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding HUMAP may be used for 
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diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of HUM AP may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
HUMAP, and to monitor regulation of HUMAP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding HUMAP or closely related molecules may be used to 
identify nucleic acid sequences which encode HUMAP. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding HUMAP, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the HUMAP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO: 18-34 or from 
genomic sequences including promoters, enhancers, and introns of the HUMAP gene. 

Means for producing specific hybridization probes for DNAs encoding HUMAP include the 
cloning of polynucleotide sequences encoding HUMAP or HUMAP derivatives into vectors for the 
production of mRN A probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 
of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as 
alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding HUMAP may be used for the diagnosis of disorders 
associated with expression of HUMAP. Examples of such disorders include, but are not limited to, a 
cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a cell differentiation 
disorder including developmental disorders such as renal tubular acidosis, anemia, Cushing's 
syndrome, achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal 
dysgenesis, WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental 
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retardation), Smith-Magenis syndrome, myelodysplasia syndrome, hereditary mucoepitheiial 
dysplasia, hereditary keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and 
neurofibromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea 
and cerebral palsy, spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, and 
sensorineural hearing loss and disorders of immune cell activation; and a disorder of cell signaling 
including endocrine disorders such as disorders of the hypothalamus and pituitary resulting from 
lesions such as primary brain tumors, adenomas, infarction associated with pregnancy, 
hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, immunological 
disorders, and complications due to head trauma; disorders associated with hyperpituitarism including 
acromegaly, giantism, and syndrome of inappropriate antidiuretic hormone (ADH) secretion (SIADH) 
often caused by benign adenoma; disorders associated with hypothyroidism including goiter, 
myxedema, acute thyroiditis associated with bacterial infection; disorders associated with 
hyperparathyroidism including Conn disease (chronic hypercalemia); pancreatic disorders such as 
Type I or Type II diabetes mellitus and associated complications; disorders associated with the 
adrenals such as hyperplasia, carcinoma, or adenoma of the adrenal cortex, hypertension associated 
with alkalosis; disorders associated with gonadal steroid hormones such as: in women, abnormal 
prolactin production, infertility, endometriosis, perturbations of the menstrual cycle, polycystic 
ovarian disease, hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, 
hermaphroditism, hirsutism and virilization, breast cancer, and, in postmenopausal women, 
osteoporosis; and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell aplasia, 
hypergonadal disorders associated with Leydig cell tumors, androgen resistance associated with 
absence of androgen receptors, syndrome of 5 a -reductase, and gynecomastia. The polynucleotide 
sequences encoding HUMAP may be used in Southern or northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in miCToarrays utilizing fluids or tissues from patients to detect altered HUMAP expression. 
Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding HUMAP may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding HUMAP may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 
value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
sample then the presence of altered levels of nucleotide sequences encoding HUMAP in the sample 
indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
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treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
HUMAP, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
5 sequence, or a fragment thereof, encoding HUMAP, under conditions suitable for hybridization or 

amplification. Standard hybridization may be quantified by comparing the values obtained from normal 
subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values obtained 
from samples from patients who are symptomatic for a disorder. Deviation from standard values is 

10 used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 

15 days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 

20 preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding HUMAP 
may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, 
or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding 

25 HUMAP, or a fragment of a polynucleotide complementary to the polynucleotide encoding HUMAP, 
and will be employed under optimized conditions for identification of a specific gene or condition. 
Oligomers may also be employed under less stringent conditions for detection or quantification of 
closely related DNA or RNA sequences. 

Methods which may also be used to quantify the expression of HUMAP include radiolabeling 

30 or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 

standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochenx 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer of interest is presented 
in various dilutions and a spectrophotometric or colorimetric response gives rapid quantitation. 
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In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The microarray can 
be used to monitor the expression level of large numbers of genes simultaneously and to identify genetic 
variants, mutations, and polymorphisms. This information may be used to determine gene function, to 
5 understand the genetic basis of a disorder, to diagnose a disorder, and to develop and monitor the 
activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 

10 (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding HUMAP may be used 
to generate hybridization probes useful in mapping the naturally occurring genomic sequence. The 
sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or to 

15 artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial 

chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI constructions, or single 
chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355; Price, 
CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 

20 mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 
965-968.) Examples of genetic map data can be found in various scientific journals or at the Online 
Mendelian Inheritance in Man (OMIM) World Wide Web site. Correlation between the location of the 
gene encoding HUMAP on a physical chromosomal map and a specific disorder, or a predisposition to 
a specific disorder, may help define the region of DNA associated with that disorder. The nucleotide 

25 sequences of the invention may be used to detect differences in gene sequences among normal, carrier, 
and affected individuals. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 

30 reveal associated markers even if the number or arm of a particular human chromosome is not known. 
New sequences can be assigned to chromosomal arms by physical mapping. This provides valuable 
information to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the disease or syndrome has been crudely localized by genetic linkage to a particular 
genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences mapping to that area may 
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represent associated or regulatory genes for further investigation. (See, e.g., Gatti, R.A. et al. (1988) 
Nature 336:577-580.) The nucleotide sequence of the subject invention may also be used to detect 
differences in the chromosomal location due to translocation, inversion, etc., among normal, carrier, or 
affected individuals. 

In another embodiment of the invention. HUMAP, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between HUMAP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with HUMAP, or fragments thereof, 
and washed. Bound HUMAP is then detected by methods well known in the art. Purified HUMAP can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding HUMAP specifically compete with a test compound for binding 
HUMAP. In this manner, antibodies can be used to detect the presence of any peptide which shares one 
or more antigenic determinants with HUMAP. 

In additional embodiments, the nucleotide sequences which encode HUMAP may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 
particular U.S. Ser. No. 60/130,694 and U.S. Ser. No. 60/140,580, are hereby expressly incorporated 
by reference. 
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EXAMPLES 



I. 



Construction of cDNA Libraries 



RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate. while others were homogenized and lysed 
5 in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushions or extracted with chloroform. RNA was precipitated Irom the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 



10 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QI AGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

15 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 

20 oligonucleotide adapters were ligated to double stranded cDNA, and the cDN A was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL SI 000, SEPH AROSE CL2B, or SEPH AROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 

25 PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2. 1 plasmid 
(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Pharmaceuticals, Palo Alto CA). Recombinant 
plasmids were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR 
from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
II. Isolation of cDNA Clones 

30 Plasmids were recovered from host cells by in vivo excision using the UNIZAP vector system 

(Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: a Magic or 
WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep purification kit (Edge 
Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, QIAWELL 8 
Ultra Plasmid purification systems or the R.E. A.L. PREP 96 plasmid purification kit from QIAGEN. 



Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
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Following precipitation, plasmids were resuspended in 0. 1 nil of distilled water and stored, with or 
without lyophilization, at 4°C. 

Alternatively, plasniid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
5 cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

10 cDNA sequencing reactions were processed using standard methods or high-throughput 

instrumentation such as the ABI CATALYST 800 (Perkin-Elmer) thermal cycler or the PTC-200 
thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or 
the MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 

15 the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 

Electrophoretic separation of cDN A sequencing reactions and detection of labeled polynucleotides were 
carried out using the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 
PRISM 373 or 377 sequencing system (Perkin-Elmer) in conjunction with standard ABI protocols and 
base calling software; or other sequence analysis systems known in the art. Reading frames within the 

20 cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 7.7). 
Some of the cDNA sequences were selected for extension using the techniques disclosed in Example VI. 

The polynucleotide sequences derived from cDN A sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 

25 references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 
algorithms used, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 

30 two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 

Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 
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The polynucleotide sequences were validated by removing vector, linker, and polyA sequences 
and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 
5 eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 
using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full 
length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 

10 and these full length sequences were subsequently analyzed by querying against databases such as the 
GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 
probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., 
Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

15 The programs described above for the assembly and analysis of full length polynucleotide and 

amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO: 18-34, Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Northern Analysis 

20 Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

and involves the hybridization of a labeled nucleotide sequence to a membrane on which RN As from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, 
supra , ch. 4 and 16.) 



25 molecules in nucleotide databases such as GenBank or LIFESEQ (Incyte Pharmaceuticals). This 

analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 
computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of the search is the product score, which is defined as: 



The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. For example, with a product score of 40, the match will be exact within a 1% to 
2% error, and, with a product score of 70, the match will be exact. Similar molecules are usually 
identified by selecting those which show product scores between 15 and 40, although lower scores may 



Analogous computer techniques applying BLAST were used to search for identical or related 



% sequence identity x % maximum BLAST score 



30 



100 
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identify related molecules. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding HUMAP occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
3. 

V. Chromosomal Mapping of HUMAP Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:27-34 were compared with 
sequences from the Incyte EIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO: 18-34 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map location of SEQ ID NO:31 is described in The Invention as range, or 
interval, of a human chromosome. The map position of an interval, in centiMorgans, is measured 
relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement 
based on recombination frequencies between chromosomal markers. On average, 1 cM is roughly 
equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to hot and cold 
spots of recombination.) The cM distances are based on genetic markers mapped by Gen&hon which 
provide boundaries for radiation hybrid markers whose sequences were included in each of the 
clusters. 

VL Extension of HUMAP Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO: 18-34 were produced by extension of an 
appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5 * extension of the known fragment, and the other 
primer, to initiate 3* extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
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temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH 4 ) 2 S0 4 , 
and P-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagcne), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 pi PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 ul of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 //I to 10 fA aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera vims endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, individual colonies were picked and cultured overnight at 37 °C in 384-well plates in LB/2x carb 
liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: 
Step 1: 94°C t 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 
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3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by 
PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries were 
reamplified using the same conditions as described above. Samples were diluted with 20% 
dimethysulfoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers and 
the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE 
Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 

In like manner, the nucleotide sequences of SEQ ID NO: 1 8-34 are used to obtain 5' regulatory 
sequences using the procedure above, oligonucleotides designed for such extension, and an appropriate 
genomic library. 

VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 18-34 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of [y- 32 P] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPH ADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 
Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIII. Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize array elements 
on the surface of a substrate. (See, e.g., Baldeschweiler, supra.) An array analogous to a dot or slot 
blot may also be used to arrange and link elements to the surface of a substrate using thermal, UV, 
chemical, or mechanical bonding procedures. A typical array may be produced by hand or using 
available methods and machines and contain any appropriate number of elements. After hybridization, 
nonhybridized probes are removed and a scanner used to determine the levels and patterns of 
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fluorescence. The degree of complementarity and the relative abundance of each probe which 
hybridizes to an element on the microarray may be assessed through analysis of the scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may comprise the 
elements of the microarray. Fragments suitable for hybridization can be selected using software well 
known in the art such as LASERGENE software (DNASTAR). Full-length cDNAs, ESTs, or 
fragments thereof corresponding to one of the nucleotide sequences of the present invention, or selected 
at random from a cDNA library relevant to the present invention, are arranged on an appropriate 
substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., UV cross-linking followed by 
thermal and chemical treatments and subsequent drying. (See, e.g., Schena, M. et al. (1995) Science 
270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) Fluorescent probes are prepared and 
used for hybridization to the elements on the substrate. The substrate is analyzed by procedures 
described above. 

IX. Complementary Polynucleotides 

Sequences complementary to the HUMAP-encoding sequences, or any pans thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring HUMAP. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same procedure 
is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are designed 
using OLIGO 4.06 software (National Biosciences) and the coding sequence of HUMAP. To inhibit 
transcription, a complementary oligonucleotide is designed from the most unique 5* sequence and used 
to prevent promoter binding to the coding sequence. To inhibit translation, a complementary 
oligonucleotide is designed to prevent ribosomal binding to the HUMAP-encoding transcript. 
X* Expression of HUMAP 

Expression and purification of HUMAP is achieved using bacterial or virus-based expression 
systems. For expression of HUMAP in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcriptioa Examples of such promoters include, but are not limited to, the trp-lac (tad) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express HUMAP upon induction with isopropyl beta-D- 
thiogalaaopyranoside (IPTG). Expression of HUMAP in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding HUMAP by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
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polyhednn promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera fruginerda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Injection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et 
al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. GeneTher. 
5 7:1937-1945.) 

In most expression systems, HUMAP is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicum , enables the purification of fusion proteins on immobilized 

10 glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolylically cleaved from HUMAP at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 

15 (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10 and 16). Purified HUMAP obtained by these methods can be used directly in the following 
activity assay. 

XL Demonstration of HUMAP Activity 

HUMAP activity is demonstrated using a generic immunoblotting strategy or through a 
20 HUMAP-specific activity assay as outlined below. As a general approach, cell lines or tissues 

transformed with a vector containing HUMAP coding sequences can be assayed for HUMAP activity 
by immunoblotting. Transformed cells are denatured in SDS in the presence of ji-mercaptoethanol, 
nucleic acids are removed by ethanol precipitation, and proteins are purified by acetone precipitation. 
Pellets are resuspended in 20 mM tris buffer at pH 7.5 and incubated with Protein G-Sepharose pre- 
25 coated with an antibody specific for HUMAP. After washing, the Sepharose beads are boiled in 

electrophoresis sample buffer, and the eluted proteins subjected to SDS-PAGE. Proteins are transferred 
from the SDS-PAGE gel to a membrane for immunoblotting, and the HUMAP activity is assessed by 
visualizing and quantifying bands on the blot using antibody specific for HUMAP as the primary 
antibody and 125 I-labeled IgG specific for the primary antibody as the secondary antibody. 
30 A specific assay for HUMAP activity measures the expression of HUMAP on the cell surface. 

cDNA encoding HUMAP is transfected into a mammalian (non-human) cell line. Cell surface proteins 
are labeled with biotin as described in de la Fuente, M.A.. et al. ((1997) Blood 90:2398-2405 ). 
Immunoprecipitations are performed using HUMAP-specific antibodies, and immunoprecipitated 
samples are analyzed using SDS-PAGE and immunoblotting techniques. The ratio of labeled 
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immunoprecipitant to unlabeled immunoprecipitant is proportional to the amount of HUMAP expressed 

on the cell surface. 

XII. Functional Assays 

HUMAP function is assessed by expressing the sequences encoding HUMAP at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA expression. 
Vectors of choice include pCMV SPORT plasmid (Life Technologies) and pCR3. 1 plasmid 
(Invitrogen), both of which contain the cytomegalovirus promoter. 5-10 fig of recombinant vector are 
transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell line, 
using either liposome formulations or electroporation. 1-2 ^g of an additional plasmid containing 
sequences encoding a marker protein are co-transfected. Expression of a marker protein provides a 
means to distinguish transfected cells from nontransfected cells and is a reliable predictor of cDNA 
expression from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent 
Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. How cytometry (FCM), an 
automated, laser optics-based technique, is used to identify transfected cells expressing GFP or CD64- 
GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM detects and 
quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with cell 
death. These events include changes in nuclear DNA content as measured by staining of DNA with 
propidium iodide; changes in cell size and granularity as measured by forward light scatter and 90 
degree side light scatter; down-regulation of DNA synthesis as measured by decrease in 
bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular proteins as 
measured by reactivity with specific antibodies; and alterations in plasma membrane composition as 
measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 
flow cytometry are discussed in Ormerod, M.G. (1994) Flow Cytometry . Oxford, New York NY. 

The influence of HUMAP on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding HUMAP and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of 
human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells 
using magnetic beads coated with either human IgG or antibody against CD 64 (DYNAL, Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art. 
Expression of mRNA encoding HUMAP and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 
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XIII. Production of HUMAP Specific Antibodies 

HUMAP substantially purified using polyacryl amide gel electrophoresis (PAGE: see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495). or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 
5 Alternatively, the HUMAP amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C -terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra , ch. 11.) 

10 Typically, oligopeptides of about 15 residues in length are synthesized using an AB1 431A 

peptide synthesizer (Perkin-Elmer) using fmcc-chernistry and coupled to KLH (Sigma- Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccirnmide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide- KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-HUMAP 

15 activity by, for example, binding the peptide or HUMAP to a substrate, blocking with 1 % BSA, 
reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIV. Purification of Naturally Occurring HUMAP Using Specific Antibodies 
Naturally occurring or recombinant HUMAP is substantially purified by irnmunoaffinity 

chromatography using antibodies specific for HUMAP. An irnmunoaffinity column is constructed by 
20 covalently coupling anti-HUMAP antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 

blocked and washed according to the manufacturer's instructions. 

Media containing HUMAP are passed over the irnmunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of HUMAP (e.g., high ionic strength 
25 buffers in the presence of detergent). The column is eluted under conditions that disrupt 

antibody/HUMAP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 

as urea or thiocyanate ion), and HUMAP is collected. 

XV. Identification of Molecules Which Interact with HUMAP 

HUMAP, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent. 
30 (See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled HUMAP, washed, 
and any wells with labeled HUMAP complex are assayed. Data obtained using different concentrations 
of HUMAP are used to calculate values for the number, affinity, and association of HUMAP with the 
candidate molecules. 
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Alternatively, molecules interacting with HUMAP are analyzed using the veasi two-hybrid 
system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which arc obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1. An isolated polypeptide comprising an amino acid sequence selected from the croup 
consisting of: 

a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-1 7, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-17, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-17, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO:l-17. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

17. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide of claim 3 selected from the group consisting of SEQ ID 
NO: 18-34. 

5. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

6. A cell transformed with a recombinant polynucleotide of claim 5. 

7. A transgenic organism comprising a recombinant polynucleotide of claim 5. 

8. A method for producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 

b) recovering the polypeptide so expressed. 

9. An isolated antibody which specifically binds to a polypeptide of claim 1. 
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10. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO: 18-34, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
5 polynucleotide sequence selected from the group consisting of SEQ ID NO: 18-34, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RN A equivalent of a)-d). 

10 1 1. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 

polynucleotide of claim 10. 

12. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 10, the method comprising: 

15 a) hybridizing the sample with a probe comprising at least 16 contiguous nucleotides 

comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 

20 present, the amount thereof. 

13. A method of claim 12, wherein the probe comprises at least 30 contiguous nucleotides. 

14. A method of claim 12, wherein the probe comprises at least 60 contiguous nucleotides. 

25 

15. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 
1 and a pharmaceutically acceptable excipient. 

16. A method for treating a disease or condition associated with decreased expression of 
30 functional HUMAP, comprising administering to a patient in need of such treatment the 

pharmaceutical composition of claim 15. 

17. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

35 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
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b) detecting agonist activity in the sample. 

18. A pharmaceutical composition comprising an agonist compound identified by a method 
of claim 17 and a pharmaceutically acceptable excipient. 

19. A method for treating a disease or condition associated with decreased expression of 
functional HUMAP, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 18. 

20. A method for screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

21. A pharmaceutical composition comprising an antagonist compound identified by a 
method of claim 20 and a pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with overexpression of functional 
HUMAP, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 21. 

23. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 4, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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60 


Lys 


Glu 


Val 


Arg 


Trp 


Leu 


Arg 


Ser 


Pro 


Phe 


Pro 


Gin 


Arg 


Ser 


Gin 










65 










70 








75 


Ala 


Val 


His 


He 


Phe 


Arg 


Asp 


Gly 


Lys 


Asp 


Gin 


Asp 


Glu 


Asp 


Leu 










80 










85 








90 


Met 


Pro 


Glu 


Tyr 


Lys 


Gly Arg 


Thr 


Val 


Leu 


Val 


Arg 


Asp 


Ala 


Gin 










95 










100 










105 


Glu 


Gly 


Ser 


Val 


Thr 


Leu 


Gin 


He 


Leu 


Asp 


Val 


Arg 


Leu 


Glu 


Asp 










110 










115 










120 


Gin 


Gly 


Ser 


Tyr 


Arg 


Cys 


Leu 


He 


Gin 


Val 


Gly 


Asn 


Leu 


Ser 


Lys 










125 










130 










135 


Glu 


Asp 


Thr 


Val 


He 


Leu 


Gin 


Val 


Ala 


Ala 


Pro 


Ser 


Val 


Gly 


Ser 










140 










145 








150 


Leu 


Ser 


Pro 


Ser 


Ala 


Val 


Ala 


Leu 


Ala 


Val 


He 


Leu 


Pro 


Val 


Leu 










155 










160 










165 


Val 


Leu 


Leu 


He 


Met 


Val 


cys 


Leu 


Cys 


Leu 


He 


Trp 


Lys 


Gin 


Arg 










170 










175 










180 


Arg 


Ala 


Lys 


Glu 


Lys 


Leu 


Leu 


Tyr 


Glu 


His 


Val 


Thr 


Glu 


Val 


Asp 










185 










190 










195 


Asn 


Leu 


Leu 


Ser 


Asp 


His 


Ala 


Lys 


Glu 


Lys 


Gly 


Lys 


Leu 


His 


Lys 










200 










205 










210 


Ala 


Val 


Lys 


Lys 


Leu 


Arg 


Ser 


Glu 


Leu 


Lys 


Leu 


Lys 


Arg 


Ala 


Ala 










215 










220 










225 


Ala 


Asn 


Ser 


Gly 


Trp 


Arg 


Arg 


Ala 


Arg 


Leu 


His 


Phe 


Val 


Ala 


Val 
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230 










235 










240 


Thr 

A All. 


Leu 


Asp 


Pro 


Asp 


Thr 


Ala 


His 


Pro 


Lys 


Leu 


He 


Leu 


Ser 


Glu 








245 










250 










255 


Asp 


Gin 


Arg 


Cvs 


Val 


Arg 


Leu 


Gly Asp 


Arg 


Arg 


Gin 


Pro 


val 


Pro 






260 










265 










270 


Asp 


Asn 


Pro 


Gin 


Arg 


Phe 


Asp 


Phe 


Val 


Val 


Ser 


He 


Leu 


Gly 


Ser 








275 










280 










285 


Glu 


Tvr 

J; J " 


Phe 


Thr 


Thr 


Gly 


Cys 


His 


Tyr 


Trp 


Glu 


Val 


Tyr 


Val 


Gly 








290 










295 










300 


Asp 


Lys 


Thr 


Lys 


Trp 


He 


Leu 


Gly Val 


Cys 


Ser 


Glu 


Ser 


Val 


Ser 








305 










310 










315 


Arg 


Lys 


Glv 

w Jr 


Lvs 


Val 
320 


Thr 


Ala 


Ser 


Pro 


Ala 
325 


Asn 


Gly 


His 


Trp 


Leu 

330 


Leu 


Arg 


Gin 


Ser 


Arg 
335 


Gly 


Asn 




TVr 
iy l 


Glu 
340 


Ala 


Leu 


Thr 


Ser 


Pro 
345 


Gin 


Thr 


Ser 


Phe 


Arg 
350 


Leu 


Lys 


Glu 


Pro 


Pro 
355 


Arg 


Cvs 


Val 


Gly 


He 
360 


Phe 


Leu 


Ast) 


Tvr 


Glu 
365 


Ala 


Gly 


Val 


He 


Ser 
370 


Phe 


Tvr 


Asn 


Val 


Thr 
375 


Asn 


Lys 


Ser 


His 


lie 
380 


Phe 


Thr 


Phe 


Thr 


His 
385 


Asn 


Phe 


Ser 


Glv 


Pro 
390 


T .oil 

J — J — u 




Pro 


Phe 


Phe 
395 


Glu 


Pro 


Cys 


Leu 


His 
400 


Asp 


Glv 


Glv 


Lys 


Asn 
405 


X 111. 




ri u 




Val 
410 


He 


Cvs 


Ser 


Glu 


Leu 
415 


His 


Lvs 


w jL 


Glu 


Glu 
420 


Ser 


He 


Val 


Pro 


Arg 
425 


Pro 


Glu 


Gly 


Lys 


Gly 
430 


His 


Ala 


Asn 


Gly 


Asp 
435 


Val 


Ser 


Leu 


Lys 


Val 


Asn 


Ser 


Ser 


Leu 


Leu 


Pro 


Pro 


Lys 


Ala 


Pro 








440 










445 










450 


Glu 


Leu 


Lys 


Asp 


He 


He 


Leu 


Ser 


Leu 


Pro 


Pro 


Asp 


Leu 


Gly 


Pro 








455 










460 










465 


Ala 


Leu 


Gin 


Glu 


Leu 
470 


Lys 


Ala 


Pro 


Ser 


Phe 
475 
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Met 


Ala 


Trp 


Asn 


Thr 


Asn 


Leu 


Arg 


Trp 


Arg 


Leu 


Pro 


Leu 


Thr 


Cys 


1 








5 










10 










15 


Leu 


Leu 


Leu 


Gin 


Val 


He 


Met 


Val 


He 


Leu 


Phe 


Gly 


Val 


Phe 


Val 










20 










25 










30 


Arg 


Tyr 


Asp 


Phe 


Glu 


Ala 


Asp 


Ala 


His 


Trp 


Trp 


Ser 


Glu 


Arg 


Thr 










35 










40 










45 


His 


Lys 


Asn 


Leu 


Ser 


Asp 


Met 


Glu 


Asn 


Glu 


Phe 


Tyr 


Tyr 


Arg 


Tyr 










50 










55 










60 


Pro 


Ser 


Phe 


Gin 


Asp 


Val 


His 


Val 


Met 


Val 


Phe 


Val 


Gly 


Phe 


Gly 










65 










70 










75 


Phe 


Leu 


Met 


Thr 


Phe 


Leu 


Gin 


Arg 


Tyr 


Gly 


Phe 


Ser 


Ala 


Val 


Gly 










80 










85 










90 


Phe 


Asn 


Phe 


Leu 


Leu 


Ala 


Ala 


Phe 


Gly 


He 


Gin 


Trp 


Ala 


Leu 


Leu 










95 










100 










105 


Met 


Gin 


Gly 


Trp 


Phe 


His 


Phe 


Leu 


Gin 


Asp 


Arg 


Tyr 


He 


Val 


Val 










110 










115 










120 


Gly 


Val 


Glu 


Asn 


Leu 


He 


Asn Ala Asp 


Phe 


Cys 


Val 


Ala 


Ser 


Val 








125 










130 










135 


Cys 


Val 


Ala 


Phe 


Gly 


Ala 


Val 


Leu 


Gly 


Lys 


Val 


Ser 


Pro 


He 


Gin 










140 










145 










150 


Leu 


Leu 


He 


Met 


Thr 


Phe 


Phe 


Gin 


Val 


Thr 


Leu 


Phe 


Ala 


Val 


Asn 
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155 










160 


Glu 


Phe 


He 


Leu 


Leu 


Asn 


Leu 


'Leu 


Lys 


Val 










170 










175 


Ser 


Met 


Thr 


He 


His 


Thr 


Phe 


Gly 


Ala 


Tyr 










185 










190 


Thr 


Arg 


He 


Leu 


Tyr 


Arg 


Arg 


Asn 


Leu 


GlU 










200 










205 


Gin 


Asn 


Ser 


Val 


Tvr 


Gin 


Ser 


Asp 


Leu 


Phe 










215 










220 


Leu 


Phe 


Leu 


Trp 


Met 


Tyr 


Trp 


Pro 


Ser 


Phe 










230 










235 


Tvr 


His 


Gly Asp 


Ser 


Gin 


His 


Arg 


Ala 


Ala 










245 










250 


Ser 


Leu 


Ala 


Ala 


Cys 


Val 


Leu 


Thr 


Ser 


Val 










260 










265 


Leu 


His 


Lys 


Lys 


Gly 


Lys 


Leu 


Asp 


Met 


Val 










275 










280 


Thr 


Leu 


Ala 


Gly 


Gly 


Val 


Ala 


Val 


Gly 


Thr 










290 










295 


Leu 


Met 


Pro 


Tyr 


Gly 


Ala 


Leu 


He 


He 


Gly 










305 










310 


Tie 


Ser 


Thr 


Leu 


GlV 


Phe 


Val 


Tyr 


Leu 


Thr 










320 










325 


■A ^ -A- y 


Leu 


His 


He 


Gin 


Asp 


Thr 


Cys 


Gly 


He 








335 










340 


He 

_1_ -L w 


Pro 


Gly 


He 


He 


Gly 


Gly 


He 


Val 


Gly 










350 










355 


Ser 


Ala 


Ser 


Leu 


Glu 


Val 


Tyr 


Gly 


Lys 


Glu 










365 










370 


Phe 


Asp 


Phe 


Gin 


Gly 


Phe 


Asn 


Gly Asp 


Trp 








380 










385 


GlV 


Lys 


Phe 


Gin 


He 


Tyr 


Gly 


Leu 


Leu 


Val 










395 










400 




Met 


Gly Gly 


He 

-i- -i- 


He 


Val 


Gly 


Leu 


He 










410 










415 


Trp 


Gly 


Gin 


Pro 


Ser 


Asp 


GlU 


Asn 


Cys 


Phe 










425 










430 


Trp 


Glu 


Met 


Pro 


Glu 


Gly 


Asn 


Ser 


Thr 


Val 








440 










445 


Pro 


Thr 


Phe 


Lys 


Pro 


Ser 


Gly 


Pro 


Ser 


Val 










455 










460 


Val 


Ser 


Pro 


Leu 


Pro 


Met 


Ala 


Ser 


Ser 


Val 










470 










475 
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Met 
1 


Ala 


Gly 


Asp 


Gly 
5 


Arg 


Arg 


Ala 


Glu 


Ala 
10 


Gly Val 


Tyr 


Val 


Thr 

20 


Pro 


Arg 


Ala 


Pro 


He 
25 


Arg 


Leu 


Ala 


Pro 


Gin 
35 


Asn 


Gly 


Gly 


Cys 


Ser 
40 


Arg 


Thr 


Pro 


Pro 


Ser 
50 


His 


Gin 


Gly 


Arg 


Arg 
55 


Asp 


GlU 


Pro 


Pro 


Glu 
65 


Val 


Tyr 


Gly Asp 


Phe 
70 


Lys 


Glu 


Arg 


Ser 


Pro 
80 


Val 


Gly 


Lys 


Arg 


Thr 
85 
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165 

Lys Asp Ala Gly Gly 

180 

Phe Gly Leu Thr Val 

195 

Gin Ser Lys Glu Arg 

210 

Ala Met He Gly Thr 

225 

Asn Ser Ala He Ser 

240 

He Asn Thr Tyr Cys 

255 

Ala He Ser Ser Ala 

270 

His He Gin Asn Ala 

285 

Ala Ala Glu Met Met 

300 

Phe Val Cys Gly He 

315 

Pro Phe Leu Glu Ser 

330 

Asn Asn Leu His Gly 

345 

Ala Val Thr Ala Ala 

360 

Gly Leu Val His Ser 

375 

Thr Ala Arg Thr Gin 

390 

Thr Leu Ala Met Ala 

405 

Leu Arg Leu Pro Phe 

420 

Glu Asp Ala Val Tyr 

435 

Tyr He Pro Glu Asp 

450 

Pro Ser Val Pro Met 

465 

Pro Leu Val Pro 



Val Arg Glu Gly Trp 

15 

Arg Glu Gly Arg Gly 

30 

Asp Ala Pro Ala Tyr 

45 

Glu Val Arg Phe Ser 

60 

Glu Pro Leu Val Ala 

75 

Arg Leu Glu Glu Phe 

90 
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Arg 


Ser 


Asp 


Ser 


Ala 
95 


Lys 


Glu 


Glu 


Val 


Arg 
100 


Glu 


Ser 


Ala 


Tyr 


Tyr 
10b 


Leu 


Arg 


Ser 


Arg 


Gin 
110 


Arg 


Arg 


Gin 


Pro 


Arg 
115 


Pro 


Gin 


Glu 


Thr 


Glu 
120 


Glu 


Met 


Lys 


Thr 


Arg 
125 


Arg 


Thr 


Thr 


Arg 


Leu 
130 


Gin 


Gin 


Gin 


His 


Ser 
135 


Glu 


Gin 


Pro 


Pro 


Leu 
140 


Gin 


Pro 


Ser 


Pro 


Val 
145 


Met 


Thr 


Arg 


Arg 


Gly 
150 


Leu 


Arg 


Asp 


Ser 


His 
155 


Ser 


Ser 


Glu 


Glu 


Asp 
160 


Glu 


Ala 


Ser 


Ser 


Gin 
165 


Thr 


Asp 


Leu 


Ser 


Gin 
170 


Thr 


lie 


Ser 


Lys 


Lys 
175 


Thr 


Val 


Arg 


Ser 


He 
180 


Gin 


Glu 


Ala 


Pro 


Ala 
185 


Val 


Ser 


Glu 


Asp 


Leu 
190 


Val 


He 


Arg 


Leu 


Arg 
195 


Arg 


Pro 


Pro 


Leu 


Arg 
200 


Tyr 


Pro 


Arg 


Tyr 


Glu 
205 


Ala 


Thr 


Ser 


Val 


Gin 
210 


Gin 


Lys 


Val 


Asn 


Phe 
215 


Ser 


Glu 


Glu 


Gly 


Glu 
220 


Thr 


Glu 


Glu 


Asp 


Asp 
225 


Gin 


Asp 


Ser 


Ser 


His 
230 


Ser 


Ser 


Val 


Thr 


Thr 
235 


Val 


Lys 


Ala 


Arg 


Ser 
240 


Arg 


Asp 


Ser 


Asp 


Glu 
245 


Ser 


Gly 


Asp 


Lys 


Thr 
250 


Thr 


Arg 


Ser 


Ser 


Ser 
255 


Gin 


Tyr 


lie 


Glu 


Ser 
260 


Phe 


Trp 


Gin 


Ser 


Ser 
265 


Gin 


Ser 


Gin 


Asn 


Phe 
270 


Thr 


Ala 


His 


Asp 


Lys 
275 


Gin 


Pro 


Ser 


Val 


Leu 
280 


Ser 


Ser 


Gly 


Tyr 


Gin 

285 


Lys 


Thr 


Pro 


Gin 


Glu 
290 


Trp 


Ala 


Pro 


Gin 


Thr 
295 


Ala 


Arg 


He 


Arg 


Thr 
300 


Arg 


Met 


Gin 


Thr 


Ser 
305 


Ser 


Pro 


Gly 


Lys 


Ser 
310 


Ser 


He 


Tyr 


Gly 


Ser 
315 


Phe 


Ser 


Asp 


Asp 


Asp 


Ser 


lie 


Leu 


Lys 


Ser 


Glu 


Leu 


Gly Asn 


Gin 










320 










325 










330 


Ser 


Pro 


Ser 


Thr 


Ser 


Ser 


Arg 


Gin 


Val 


Thr 


Gly Gin 


Pro 


Gin 


Asn 










335 










340 










345 


Ala 


Ser 


Phe 


Val 


Lys 
350 


Arg 


Asn 


Arg 


Trp 


Trp 
355 


Leu 


Leu 


Pro 


Leu 


He 
360 


Ala 


Ala 


Leu 


Ala 


Ser 
365 


Gly 


Ser 


Phe 


Trp 


Phe 
370 


Phe 


Ser 


Thr 


Pro 


Glu 
375 


Val 


Glu 


Thr 


Thr 


Ala 
380 


Val 


Gin 


Glu 


Phe 


Gin 
385 


Asn 


Gin 


Met 


Asn 


Gin 
390 


Leu 


Lys 


Asn 


Lys 


Tyr 
395 


Gin 


Gly 


Gin 


Asp 


Glu 
400 


Lys 


Leu 


Trp 


Lys 


Arg 
405 


Ser 


Gin 


Thr 


Phe 


Leu 
410 


Glu 


Lys 


His 


Leu 


Asn 
415 


Ser 


Ser 


His 


Pro 


Arg 
420 


Ser 


Gin 


Pro 


Ala 


lie 
425 


Leu 


Leu 


Leu 


Thr 


Ala 
430 


Ala 


Arg 


Asp 


Ala 


Glu 
435 


Glu 


Ala 


Leu 


Arg 


Cys 
440 


Leu 


Ser 


Glu 


Gin 


lie 
445 


Ala 


Asp 


Ala 


Tyr 


Ser 
450 


Ser 


Phe 


Arg 


Ser 


Val 
455 


Arg 


Ala 


lie 


Arg 


lie 
460 


Asp 


Gly 


Thr 


Asp 


Lys 
465 


Ala 


Thr 


Gin Asp 


Ser 


Asp 


Thr 


Val 


Lys 


Leu 


Glu 


Val 


Asp 


Gin 


Glu 










470 










475 










480 


Leu 


Ser 


Asn 


Gly 


Phe 
485 


Lys 


Asn 


Gly 


Gin 


Asn 
490 


Ala 


Ala 


Val 


Val 


His 
4 95 


Arg 


Phe 


Glu 


Ser 


Phe 
500 


Pro 


Ala 


Gly 


Ser 


Thr 
505 


Leu 


He 


Phe 


Tyr 


Lys 
510 


Tyr 


Cys 


Asp 


His 


Glu 
bib 


Asn 


Ala 


Ala 


Phe 


Lys 
c: o n 


Asp 


Val 


Ala 


Leu 


Val 


Leu 


Thr 


Val 


Leu 


Leu 
530 


Glu 


Glu 


Glu 


Thr 


Leu 
535 


Gly 


Thr 


Ser 


Leu 


Gly 
540 


Leu 


Lys 


Glu 


Val 


Glu 
545 


Glu 


Lys 


Val 


Arg 


Asp 
550 


Phe 


Leu 


Lys 


Val 


Lys 
555 


Phe 


Thr 


Asn 


Ser 


Asn 


Thr 


Pro 


Asn 


Ser 


Tyr 


Asn 


His 


Met 


Asp 


Pro 










560 










565 








570 


Asp 


Lys 


Leu 


Asn 


Gly 
575 


Leu 


Trp 


Ser 


Arg 


He 
580 


Ser 


His 


Leu 


Val 


Leu 
585 


Pro 


Val 


Gin 


Pro 


Glu 


Asn 


Ala 


Leu 


Lys 


Arg 


Gly 


He 


Cys 


Leu 
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Met Asp 


Leu Leu 


Trp 


lie 


Leu 


Pro 


Ser 


Leu 


Trp 


Leu 


Leu 


Leu 


Leu 


1 




5 










10 










15 


Gly Gly 


Pro Ala 


Cys 


Leu 


Lvs 


Thr 


Gin 


Glu 


His 


Pro 


Ser 


Cys 


Pro 






20 










25 










30 


Gly Pro 


Arg Glu 


Leu 


Glu 


Ala 


Ser 


Lys 


Val 


Val 


Leu 


Leu 


Pro 


Ser 






35 










40 










45 


Cys Pro 


Gly Ala 


Pro 


Gly 


Ser 


Pro 


Gly 


GlU 


Lys 


Gly 


Ala 


Pro 


Gly 






50 










55 










60 


Pro Gin 


Gly Pro 


Pro 


Gly 


Pro 


Pro 


Gly 


Lys 


Met 


Gly 


Pro 


Lys 


Gly 






65 










70 










75 


Glu Pro 


Gly Asp 


Pro 


Val 


Asn 


Leu 


Leu 


Arg 


Cys 


Gin 


Glu 


Gly 


Pro 






80 










85 










90 


Arg Asn 


cys Arg 


Glu 


Leu 


Leu 


Ser 


Gin 


Gly 


Ala 


Thr 


Leu 


Ser 


Gly 






95 










100 










105 


Trp Tyr 


His Leu 


Cys 


Leu 


Pro 


Glu 


Gly 


Arg 


Ala 


Leu 


Pro 


Val 


Phe 






110 










115 










120 


Cys Asp 


Met Asp 


Thr 


Glu 


Gly 


Gly Gly 


Trp 


Leu 


Val 


Phe 


Gin 


Arg 






125 










130 










135 


Arg Gin 


Asp Gly 


Ser 


Val 


Asp 


Phe 


Phe 


Arg 


Ser 


Trp 


Ser 


Ser 


Tyr 






14 U 










145 










150 


Arg Ala 


Gly Phe 


Gly 


Asn 


Gin 


Glu 


Ser 


Glu 


Phe 


Trp 


Leu 


Gly 


Asn 






155 










160 










165 


Glu Asn 


Leu His 


Gin 


Leu 


Thr 


Leu 


Gin 


Gly 


Asn 


Trp 


Glu 


Leu 


Arg 






170 










175 










180 


Val Glu 


Leu Glu 


Asp 


Phe 


Asn 


Gly 


Asn 


Arg 


Thr 


Phe 


Ala 


His 


Tyr 






185 










190 
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Ala Thr 


Phe Arg 


Leu 


Leu Gly 


Glu 


Val 


Asp 


His 


Tyr 


Gin 


Leu 


Ala 
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205 










210 


Leu Gly 


Lys Phe 


Ser 


Glu 


Gly 


Thr 


Ala 


Gly 


Asp 


Ser 


Leu 


Ser 


Leu 






215 










220 










225 


His Ser 


Gly Arg 


Pro 


Phe 


Thr 


Thr 


Tyr 


Asp 


Ala 


Asp 


His 


Asp 


Ser 






230 










235 
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Ser Asn 


Ser Asn 


Cys 


Ala 


Val 


lie 


Val 


His 


Gly 


Ala 


Trp 


Trp 


Tyr 






245 










250 










255 


Ala Ser 


cys Tyr 


Arg 


Ser 


Asn 


Leu 


Asn 


Giy 


Arg 


Tyr 


Ala 


Val 


Ser 






260 










265 










270 


Glu Ala 


Ala Ala 


His 


Lys 


Tyr 


Gly 


He 


Asp 


Trp 


Ala 


Ser 


Gly 


Arg 
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280 










285 


Gly Val 


Gly His 


Pro 


Tyr 


Arg 
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Val 


Arg 


Met 


Met 


Leu 


Arg 
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Leu 


Lys 


Asp 


Asp 


Leu 


Gin 


Gly Ala 


Gin 
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He 


GlU 
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Glu 
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Leu 


Arg 
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Glu 
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He 


Glu 


Ala 
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40 
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Leu 


Ala 


Arg 
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Lys 


Gin 


Lys 


Cys 


Phe 


GlU 


Leu 


Gin 


Ala 


Leu 










50 










55 










60 


Leu 


Glu 


Glu 


Glu 


Arg 


Lys 


Ala 


Tyr 


Arg 


Asn 


Gin 


Val 


Glu 


Glu 


Ser 










65 










70 










75 


Thr 


Lys 


Gin 


He 


Gin 


Val 


Leu 


Gin 


Ala 


Gin 


Leu 


Gin 


Arq 


Leu 


His 










80 










85 










90 


lie 


Asp 


Thr 


Glu 


Asn 


Leu 


Arg 


Glu 


Glu 


Lys 


Asp 


Ser 


GlU 


He 


Thr 










95 










100 










105 


Ser 


Thr 


Arg 


Asp 


Glu 


Leu 


Leu 


Ser 


Ala 


Arg 


Asp 


Glu 


He 


Leu 


Leu 










110 










115 










120 


Leu 


His 


Gin 


Ala 


Ala 


Ala 


Lys 


Val 


Ala 


Ser 


GlU 


Arcr 


Asp 


Thr 


Asp 










125 










130 










135 


lie 


Ala 


Ser 


Leu 


Gin 


GlU 


Glu 


Leu 


Lys 


Lvs 


Val 


Arcr 


Ala 


Glu 


Leu 










140 










145 










150 


Glu 


Arcr 


Trp 


Arg 


Lys 


Ala 


Ala 


Ser 


Glu 


Tvr 


Glu 


Lys 


Glu 


He 


Thr 










155 










160 










165 


Ser 


Leu 


Gin 


Asn 


Ser 


Phe 


Gin 


Leu 


Arcr 


Cys 


Gin 


Gin 


Cys 


Glu 


Asr> 










170 










175 










180 


Gin 


Gin 


Arg 


Glu 


GlU 


Ala 


Thr 


Arg 


Leu 


Gin 


Glv 


GlU 


Leu 


Glu 


X— j y ^> 










185 










190 










195 


Leu 


Arg 


Lvs 


Glu 


Trp 


Asn 


Ala 


Leu 


Glu 


Thr 


GlU 


Cys 


His 


Ser 


Leu 










200 










205 










210 


Lvs 


Arg 


Glu 


Asn 


Val 


Leu 


Leu 


Ser 


Ser 


Glu 


Leu 


Gin 


Arg 


Gin 


Glu 










215 










220 








225 


Lvs 


Glu 


Leu 


His 


Asn 


Ser 


Gin 


Lys 


Gin 


Ser 


Leu 


Glu 


Leu 


Thr 












230 










235 










240 


Asp 


Leu 


Ser 


He 


Leu 


Gin 


Met 


Ser 


Arcr 


Lvs 


Glu 


Leu 


Glu 


Asn 


Gin 










245 










250 










255 


val 


Gly 


Ser 


Leu 


Lys 


Glu 


Gin 


His 


Leu 


Arg 


Asp 


Ser 


Ala 


Asp 


Leu 










260 










265 








270 


Lvs 
-i 


Thr 


Leu 


Leu 


Ser 


Lys 


Ala 


Glu 


Asn 


Gin 


Ala 


Lvs 


Asp 


Val 


Gin 










275 










280 










285 


Lys 


Glu 


Tyr 


GlU 


Lys 


Thr 


Gin 


Thr 


Val 


Leu 


Ser 


Glu 


Leu 


Lvs 


Leu 










290 










295 










3 00 

J U Lf 


Lys 


Phe 


Glu 


Met 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Ser 


He 


Thr 


Asp 


Glu 










305 










310 








315 


Leu 


Lys 


Gin 


Cys 


Lys 


Asn 


Asn 


Leu 


Lys 


Leu 


Leu 


Arg 


Glu 


Lys 


Gly 










320 










325 










330 


Asn 


Asn 


Pro 


Ser 


He 


Leu 


Gin 


Pro 


Val 


Pro 


Ala 


Val 


Phe 


He 


Gly 










335 










340 










345 


Leu 


Phe 


Leu 


Ala 


Phe 


Leu 


Phe 


Trp 


Cys 


Phe 


Gly 


Pro 


Leu 


Trp 












350 










355 













<210> 17 
<211> 160 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 4628933CD1 

<400> 17 



Met 


He 


Asn 


Pro 


Glu 


Leu 


Arg 


Asp 


Gly 


Arg 


Ala 


Asp 


Gly 


Phe 


He 


1 








5 










10 










15 


His 


Arg 


He 


Val 


Pro 
20 


Lys 


Leu 


He 


Gin 


Asn 
25 


Trp 


Lys 


He 


Gly 


Leu 
30 


Met 


Cys 


Phe 


Leu 


Ser 


He 


He 


He 


Thr 


Thr 


Val 


Cys 


He 


He 


Met 










35 










40 








45 


He 


Ala 


Thr 


Trp 


Ser 
50 


Lys 


His 


Ala 


Lys 


Pro 
55 


Val 


Ala 


Cys 


Ser 


Gly 
60 
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Asp 


Trp 


Leu 


Gly 


Val 
65 


Arq 


Asp 


Thr 


Ara 


Asn 


Trp 


Thr 
80 


Ala 


Ser 


Ala 


Glu 


Leu 


Ala 


Gin 

95 


lie 


Asp 


Lys 


Arg 


Tyr 


Ala 


Gly 
110 


Thr 


Asp 


Lys 


Gin 


Gly 


Asp 


Ser 
125 


Trp 


Lys 


Gly 


Trp 


Pro 


Ser 


Asn 
140 


Ser 


Lys 


Gin 


Trp 


Leu 


Leu 


Leu 
155 


Leu 


Gly 



Lys 


Cys 


Phe 


Tyr 


Phe 


Ser 


Asp 


Asp 






70 










75 


Lys 


He 


Phe 


Cys 


Ser 


Leu 


Gin 


Lys 






85 










90 


Thr 


Gin 


Glu 


Asp 


Met 


Glu 


Phe 


Leu 






100 










105 


Met 


His 


Trp 


He 


Gly 


Leu 


Ser 


Arg 






115 










120 


Trp 


Thr 


Asn 


Gly 


Thr 


Thr 


Phe 


Asn 






130 










135 


Trp 


Ser 


Cys 


Asn 


Trp 


Ser 


Leu 


Arg 






145 










150 


Pro 


Leu 


Arg 













160 



<210> 18 

<211> 1147 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 402771CB1 

<400> 18 

gcgagctggg gccgcgcaat gtcgcacgga gccgggctcg tccgcaccac gtgcagcagc 60 

ggcagcgcgc tcggacccgg ggccggcgcg gcccagccca gcgcgagccc cttggagggg 120 

ctgctggacc tcagctaccc ccgcacccac gcggccctgc tgaaagtggc gcaaatggtc 180 

accctgctga ttgccttcat ctgtgtgcgg agctccctgt ggaccaacta cagcgcctac 240 

agctactttg aagtggtcac catttgcgac ttgataatga tcctcgcctt ttacctggtc 3 00 

cacctcttcc gcttctaccg cgtgctcacc tgtatcagct ggcccctgtc ggaacttctg 360 

cactatttaa tcggtaccct gctcctcctc atcgcctcca ttgtggcagc ttccaagagt 420 

tacaaccaga gcggactggt agccggagcg atctttggtt tcatggccac cttcctctgc 480 

atggcaagca tatggctgtc ctataagatc tcgtgtgtaa cccagtccac agatgcagcc 54 0 

gtctgatgag gccacaaccc ctaggcccct caggagcttt gcagagagga ggacgtgtac 600 

tccaggcgag gcctctggac ctgtgttcct gtgccaaagt cctgtcaggc tggtgggcac 660 

caggaaaggc ctgcaccctc ttcctgctct cccaggaagc cagctccctg agctcctgag 720 

ccagccggaa actcttcctc cagccttccg gggagaacat ccctcccatt ctgggaaagg 780 

aaagcagcct ccagggaaat gttttctgcc ttcctgcttc tagaaccacc tcaggtactg 840 

atgaacccca cttagcacag ctgaaggggt ttgtgaatac tcccgcctaa atcccttcta 900 

cttcactcct caggggagtg aagtgcctta agaaacaaag ccctgtccta atttatctag 960 

cttgtcagtc cggtcttaga gataccctct ttcctgaagt gaggcgtgcc tgtagaaaca 1020 

ctatgtggtc agcctgtccc caaggagatc ttgtgtctcc tctccatctc tgcctttgtt 1080 

accagtgtgc atgtgtttgt gtgtttttta ataaaatatt gactcggcca gttaaaaaaa 1140 

aaaaaaa 1147 

<210> 19 

<211> 1260 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 1296216CB1 

<400> 19 

gcttaggccc atccacaatc tcatcaagac tggatggctc ccgctggact atactggata 60 
tgtaggcaca gagcctacac tcagttacct gatcaatggg caggtagttg tgtcattggc 120 
accattaagc catccttttt cctactgccc ataaaaacag gtgacctcct aggtttccct 180 
gtctatgcct cccaagaaaa gaggagcata gctatagcaa attggaaaga tgatgagtgg 24 0 
tgccccgaaa ggatcataca gtactatagg cctgccacat gggcacaaga cgactcatag 3 00 
ggataccata cccccatcta catgctcaac tggatcatac ggttacaggc catcttagaa 3 60 
ataatcacta gtgaaactgg cagagcttta actcttttag cccagcagga aacccaaatg 420 
agaaatgcca tctatcagaa tagattggcc ttagactatt tgctggcagc tgaaggagga 480 
gtctgtgaaa aattcaactt gaccaagtgc tgtctgcaaa gagatgatca aggacaagta 54 0 
gtcaaaaata tagttagaga catgacaaag ctggcacatg tacccatgca ggtttggcat 600 
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aggt ttgatc 
cttataatag 
atatttctcc 
caagtatac t 
gataatgaga 
atttctccta 
agac tgatca 
ttgtgctatc 
ccacctagcc 
tagagctgta 
gcaagcctgc 



ctggatccct 
gaatgataat 
agttactaag 
acatgaatca 
actcccacta 
ctgtccccta 
gcaccactgg 
tatagatcac 
atgtacccca 
agccc ttaaa 
cgatgctcc t 



<210> 20 

<211> 2387 

<212> DNA 

<213> Homo sapiens 



gtt tggaaaa 
ggtgt tagga 
aagct tcgtt 
ctat tgatc t 
gtgagtgagg 
tc tccaaaga 
ccaggcccgt 
agacaatggt 
tgcttgt tea 
agggecaaga 
ggccgaataa 



tggcttccag 
acctgeatgt 
at cacct cag 
gtc tcacagg 
ttctcaaagg 
aaaggaggaa 
aggt taaaga 
atggagaaac 
atctatcacg 
ac tctttc tc 
agectet tec 



ctctaggagg 
tactcccctg 
ttcatcaaaa 
aagatctaga 
ggggaatgag 
gtaaaaactg 
t taacccaca 
acttgecttg 
accctttcat 
tgggactcag 
t tctttaaaa 



atttaaaact 660 

tatgctaccc 720 

gacctcagca 780 

tagtgaggat 84 0 

gagagaggee 9 00 

aaaaaataac 9 60 

ccctaaccgc 1020 

ctcaccaccc 1080 

gtggacccct 1140 

ttcttgagac 1200 

aaaaaaaaaa 12 6 0 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 1693173CB1 

<400> 20 

gccatccttt ctccctttcc tcttgcctcc cacatcctcc cgccctcctc caccgctgcc 60 
cgcgtttcct cccgcccctc ccgaaggcag aegggcaggg tgtgagggcc cgtcctcccg 120 
gctcccctgg aggectgaca gcaggtgcaa gagcagaggc tgccaggccc tggcctctcc 180 
ccactgcgtc cgccggatgc tccccaggaa ggggacagee gcgtgcccag ggtggccaac 240 
ctcccactct gtccctcagg ggtctggaat aacaatccag aggacgactt caggatgccc 3 00 
aatggctcca ccattccccc agggagcect gaggagatgc ttttccactt tggaatgacc 3 60 
tggcagatca aegggacagg cctccttggc aagaggaatg accagctgcc ttccaacttc 420 
acccctgttt tctactcaca actgeaaaaa aacagctcct gggctgaaca tttgatctcc 480 
aactgtgacg gagatagctc atgeatctat gacaccctgg ccctgcgcaa cgcaagcatc 540 
ggacttcaca cgagggaagt cagtaaaaac tacgagcagg cgaacgccac cctcaatcag 600 
tacccgccct ccatcaatgg tggtcgtgtg attgaagect acaaggggca gaccacgctg 660 
attcagtaca ccagcaatgc tgaggatgee aacttcacgc tcagagacag ctgcaccgac 720 
ttggagctct ttgagaatgg gacgttgctg tggacaccca agtcgctgga gccattcact 780 
ctggagattc tagcaagaag tgecaagatt ggcttggcat ctgcactcca gcccaggact 840 
gtggtctgcc attgeaatge agagagecag tgtttgtaca atcagaccag cagggtgggc 900 
aactcctccc tggaggtggc tggctgcaag tgtgacgggg gcaccttcgg ccgctactgc 960 
gagggctccg aggatgectg tgaggagccg tgcttcccga gtgtccactg cgttcctggg 1020 
aagggctgcg aggcctgccc tccaaacctg actggggatg ggcggcactg tgcggctctg 1080 
gggagctctt tcctgtgtca gaaccagtcc tgccctgtga attactgeta caatcaaggc 1140 
cactgctaca tctcccagac tctgggctgt cagcccatgt gcacctgccc cccagccttc 1200 
actgacagcc gctgcttcct ggctgggaac aacttcagtc caactgtcaa cctagaactt 1260 
cccttaagag tcatccagct ettgetcagt gaagaggaaa atgcctccat ggcagaggtc 1320 
aacgcctcgg tggcatacag actggggacc ctggacatgc gggectttet ccgcaacagc 1380 
caagtggaac gaatcgattc tgcagcaccg gectegggaa gccccatcca acactggatg 1440 
gtcatctegg agttccagta ccgccctcgg ggcccggtca ttgacttcct gaacaaccag 1500 
ctgctggccg cggtggtgga ggcgttctta taccacgttc caeggaggag tgaggagece 1560 
aggaacgacg tggtcttcca gcccatttcc ggggaagacg tgcgcgatgt gacagccctg 162 0 
aacgtgagca egctgaagge ttacttcaga tgcgatggct acaagggcta cgacctggtc 1680 
tacagccccc agageggett cacctgcgtg tccccgtgca gtaggggcta ctgtgaccat 1740 
ggaggccagt gccagcacct gcccagtggg ccccgctgca gctgtgtgtc cttctccatc 1800 
tacacggcct ggggegagea ctgtgagcac ctgagcatga aactcgacgc gttcttegge 1860 
atcttctttg gggtcctggg cggcctcttg ctgctggggg tegggaegtt cgtggtcctg 1920 
cgcttctggg gttgetcegg ggccaggttc tcctatttcc tgaactcagc tgaggccttg 1980 
ccttgaaggg gcagctgtgg cctaggctac ctcaagactc acctcatcct taccgcacat 2040 
ttaaggegee attgettttg ggagactgga aaagggaagg tgactgaagg ctgtcaggat 2100 
tcttcaagga gaatgaatac tgggaatcaa gacaagacta taccttatcc ataggegcag 2160 
gtgcacaggg ggaggecata aagatcaaac atgcatggat gggtcctcac gcagacacac 222 0 
ccacagaagg acactagcct gtgcacgcgt gcgtgcgcac acacacacac gagttcataa 2280 
tgtggtgatg gecctaagtt aagcaaaatg cttctgcaca caaaactctc tggtttactt 2340 
caaattaact ctatttaaat aaagtctctc tgactttttg tgtcttc 2387 

<210> 21 
<211> 2172 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 2095069CB1 

<400> 21 

cccgcccgct aatgttttgg ccgcttcaag atggcggtgc aggagtcggc ggctcagttg 60 

tccatgaccc tgaaggtcca ggagtacccg accctcaagg tgccctacga gacgctgaac 120 
aaacgctttc gcgccgctca gaagaacatt gaccgggaga ccagccacgt caccatggtg 180 

gtggccgagc tggagaagac gttgagcggc tgccccgccg tggactccgt ggtcagcctg 240 

ctggacggcg tggtggagaa gctcagcgtc ctcaagagga aggcggtgga atccatccag 300 

gccgaggacg agagcgccaa gctgtgcaag cgccggatcg agcacctcaa agagcatagc 3 60 

agcgaccagc ccgcggcggc cagcgtgtgg aagaggaagc gcatggatcg catgatggtg 420 

gagcacctgc tgcgttgcgg ctactacaac acggctgtca agctggcgcg ccagagcggc 480 

atcgaggacc tagtgaatat tgagatgttc ctgacggcca aagaggtgga ggagtccctg 540 

gagaggcgtg agacggccac ctgcctggcc tggtgccatg acaacaagtc ccggctccgg 600 

aagatgaaga gctgcctgga gttcagcctc agaatccagg agttcattga actcatccgg 660 

cagaataaga gactggacgc tgtgagacat gcaagaaagc acttcagcca agcagaaggg 72 0 

agccagctgg acgaggtgcg ccaggccatg ggcatgctgg ccttcccgcc cgacacgcac 780 

atctccccgt acaaggacct tctggaccct gcacggtggc ggatgctgat ccagcagttc 840 

cggtacgaca actaccgact acaccagctg ggaaacaatt ctgtgttcac cctcaccctg 900 

caggctggcc tctcagccat caagacacca cagtgctaca aggaggacgg cagctccaag 960 

agccctgact gccctgtgtg cagccgctcc ctgaacaagc tggcgcagcc cctgcccatg 1020 

gcccactgtg ccaactcccg cctggtctgc aagatttctg gcgacgtgat gaacgagaac 1080 

aatccgccca tgatgctgcc caacggctac gtctacggct acaattctct gctttctatc 1140 

cgtcaagatg ataaagtcgt gtgcccgaga accaaagaag tcttccactt ctcacaagcc 1200 

gagaaggtgt acatcatgta ggccccacgt cgtgaagcgc acgcctcggg gacgggctgc 1260 

agtgggcggg gaggccacgc cttcctcctg tcccacgctc cagcctgccg cggcgtttct 1320 

gtttcttgcg accaaagatc cgtgagcaac gataaatact cttaggaaga gagaaaataa 13 80 

ggtttcataa gtttgtactt gaaaacattt ggattggtag gattttgtaa cacgtcaacc 1440 

atttgatgct tctgaaaagt actttcaact tgcgaaggaa actcttcttt aaagactgac 1500 

ctaaacaccg agggaaactt aagaacgttt aaaatatagg agtccgtgat ttccctgtgt 1560 

tttcagtttc tttccttctg tgaacgatga gacttggaga acgggctggt ccttcaccac 1620 

ttcctgttgg ccctggcctg gccggggaag gtggcagcgg caccggactg acctgcagtg 1680 

acccgcgatg ccgcgccacg agggacactt atggcttcat tcgagagctg ctgccaaaac 1740 

gcctggcgcc gccaccgtcg ggggctggct tcgaggacgc ccgcctgcct cgcgggtcgt 1800 

gtccgcggga ctgtgttcgt acgtgcatag tttcgatatc acatcgcggg gctgtgttcg 1860 

tagctgcgtc gtttcgatat cacaccctct gtgtgccgcc ttacttcctg cttcgagaat 1920 

gtataacgtg gaaatccacg ggaccaaatt tctgcagagg ccttgccgga tggttccata 1980 

actgtagagt ctaattgcta tccattacag aaattaatcg ttcagttgaa agaagtactg 2040 

atgacttttc aaaacaaatg aaccaccgta gctgacagag aaccgtatcg tagaggtttg 2100 

tagttagtgc ttatttttgc atgttgatgt tgactagcta ataaactgta aatgtaaacc 2160 
aaaaaaaaaa aa 2172 

<210> 22 
<211> 2328 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 2645927CB1 

<400> 22 

cgcaggcaat gctcagccct ggatgtagct gagaggctgg gagaagagac gaccgctgga 60 

gaccgagcgg cgtggggaag acctaggggg gtgggtgggg gaagcagaca ggagaacact 120 

cgaaatcaag cgctttacag attattttat tttgtataga gaacacgtag cgactccgaa 180 

gatcagcccc aatgaacatg tcagtgttga ctttacaaga atatgaattc gaaaagcagt 24 0 

tcaacgagaa tgaagccatc caatggatgc aggaaaactg gaagaaatct ttcctgtttt 3 00 

ctgctctgta tgctgccttt atattcggtg gtcggcacct aatgaataaa cgagcaaagt 360 

ttgaactgag gaagccatta gtgctctggt ctctgaccct tgcagtcttc agtatattcg 420 

gtgctcttcg aactggtgct tatatggtgt acattttgat gaccaaaggc ctgaagcagt 480 

cagtttgtga ccagggtttt tacaatggac ctgtcagcaa attctgggct tatgcatttg 54 0 

tgctaagcaa agcacccgaa ctaggagata caatattcat tattctgagg aagcagaagc 600 

tgatcttcct gcactggtat caccacatca ctgtgctcct gtactcttgg tactcctaca 660 
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aagacatggt tgccggggga ggttggttca tgactatgaa ctatggcgtg cacgccgtga 720 
tgtactctta ctatgccttg cgggcggcag grttccgagt ctcccggaag tttgccatgc 780 
tcatcacctt gtcccagatc actcagatgc tgatgggctg tgtggttaac tacctggtct 840 
tctgctggat gcagcatgac cagtgtcact ctcactttca gaacatcttc tggtcctcac 900 
tcatgtacct cagctacctt gtgctcttct gccatctctt ctttgaggcc tacatcggca 960 
aaatgaggaa aacaacgaaa gctgaatagt gtirggaactg aggaggaagc catagctcag 1020 
ggtcatcaag aaaaataata gacaaaagaa aatggcacaa ggaatcacac gtggtgcagc 1080 
taaaacaaaa caaaacatga gcaaacacaa aacccaaggc agcttaggga taattaggtt 114 0 
gatttaaccc agtaagttta tgatcctttt agggtgagga ctcactgagt gcacctccat 1200 
ctccaagcac tgctgctgga agaccccatt ccctctttat ctatcaactc taggacaagg 1260 
gagaacaaaa gcaagccaga agcagaggag actaatcaaa ggcaaacaaa ggctattaac 1320 
acataggaaa aaatgtattt actaagtgtc acatttctct aagatgaaag atttttactc 1380 
tagaaactgt gcgagcacaa cacacacaat cctttctaac tttatggaca ctaaactgga 1440 
gccaatagaa aagacaaaaa tgaaagagac acagggtgta tatctagaac gataatgctt 1500 
ttgcagaaac taaagccttt ttaagaaatg ccagctgctg tagaccccat gagaaaagat 1560 
gtcttaatca tccttatgaa aacagatgta aacaactata tttcaactaa cttcatcttc 1620 
actgcatagc ctcaggctag tgagtttgcc aaaaccaaag ggggtgaata cttccccaag 1680 
attcttcctg ggaggatgga aacagtgcag cccaggtccc atgggggcag ctccatccca 1740 
gagcatttct gatagttgaa ctgtaatttc tactcttaag tgagatatga agcattatcc 1800 
ttttgttcag ttgccccggg cttttgaaca gaagagtaaa tacagaattg aaaaagataa 1860 
acactcaacc aaacaatgtg aaaacgggtt ctgtagtatt tgtaaaaagg cccggcccag 1920 
gaccactgtg agctggaaaa gggagaaagg cagtgggaaa agaggtgagc cgaagatcaa 1980 
ttcgacagac agatggtgtg tatgcccctc cctgtttgac ttcacacaca ctcataactt 2040 
tccaaatgaa accccacagc atagcgcata ttttcgatat ttttgtgaat tccaaaagga 2100 
aatcacaggg ctgttcgaaa tattggggga acactgtgtt tctgcatcat ctgcatttgc 2160 
tccccaagca atgtagaggt gtttaaaggg ccctctgctg gctgagtggc aatactacaa 2220 
caaacttcaa ggcaagtttg gctgaaaaca gttgacaaca aagggccccc atacacttat 2280 
ccctcaaatt ttaagtgata tgaaatactt gtcatgtctt tggccaaa 2328 

<210> 23 

<211> 1361 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 2732365CB1 

<400> 23 

gggctgcgtg cacctgctgg ccggggaccc cgaggccgcg ctgcgggcat ttgaccaagc 60 
cgtgaccaag gacacctgca tggcggttgg cttcttccag cgaggagtgg ccaacttcca 120 
gctggcaagg ggaccccggc tgcccacagg ttccaggagg ctctgtctga cttctggctg 180 
gccctggagc agctgagggg ccacgctgcc atcgactaca cgcagctggg cctgcggttc 240 
aagctgcaag cctgggaggt gctacacaat gtggcgtcgg cacagtgcca gctggggctc 3 00 
tggacagagg cggccagcag cctaagggag gccatgtcca agtggccgga ggggtccctg 360 
aatggcctgg actcagccct ggaccaagtg cagagacggg gctcactgcc gccacggcag 420 
gtccccaggg gcgaggtctt ccggccccac cggtggcacc tgaagcactt ggagcccgtg 480 
gatttcctgg gcaaggccaa ggtggtggcc tctgccatcc ccgacgacca gggctggggc 540 
gtccgccctc agcagccaca gggaccagga gcgaaccatg atgccaggtc cctaatcatg 600 
gactccccaa gagctggcac ccaccagggc cccctcgatg cagagacaga ggtcggtgct 660 
gaccgctgca cgtcgactgc ctaccaggag cagaggcccc aggtggagca agttggcaaa 72 0 
caggctcctc tctccccagg gctgccggca atgggggggc ctggccccgg cccctgtgag 780 
gaccccgcgg gtgctggggg agtaggtgca gggggctccg agcccctggt gactgtcacc 840 
gtgcagtgcg ccttcacagt ggccctgagg gcacgaagag gagccgacct gtccagcctg 900 
cgggcactgc tgggccaagc cctccctcac caggcccagc ttgggcaact cagttaccta 960 
gccccaggtg aggacgggca ctgggtcccc atccccgagg aggagtcgct gcagagggcc 1020 
tggcaggacg cagctgcctg ccccaggggg ctgcagctgc agtgcagggg agccgggggt 1080 
cggccggtcc tctaccaggt ggtggcccag cacagctact ccgcccaggg gccagaggac 1140 
ctgggcttcc gacaggggga cacggtggac gtcctgtgtg aagtggacca ggcatggctg 12 00 
gagggccact gtgacggccg catcggcatc ttccccaagt gcttcgtggt ccccgccggc 1260 
cctcggatgt caggagcccc cggccgcctg ccccgatccc agcagggaga tcagccctaa 1320 
tgatgctgtg tccatgatgc ttttaataaa aacaaccccc a 13 61 

<210> 24 

<211> 789 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No.: 3536577CB1 



<400> 24 

agtggattag ttttgggttt cacttaatct gggaggccct tccgcaatcg gagccctcac 60 
agaggccaaa ctgatataaa tctgcttagg aggcctgatt cacagacgct acaggatgga 120 
gcggggcgca ggagccaagc tgctgccgct gctgctgctt ctgcgggcga ctggtttcac 180 
atgtgcacag gcagatggcc ggaacggcta cacggcggtc atcgaagtga ccagcggggg 240 
tccctggggc gactgggcct ggcctgagat gtgtcccgat ggattcttcg ccagcgggtt 3 00 
ctcgctcaag gtggagcctc cccaaggcat tcctggcgac gacactgcac tgaatgggat 360 
caggctgcac tgcgcgcgcg ggaacgtcct aggcaatacg cacgtggtag agtcccagtc 420 
tggaagctgg ggcgaatgga gtgagccgct gtggtgtcgc ggcggcgcct acctagtggc 480 
tttctcgctt cgcgtggagg cacccacgac cctcggtgac aacacagcag cgaacaacgt 540 
gcgcttccgc tgttcagacg gcgaggaact gcaggggcct gggctgagct ggggagactt 600 
tggagactgg agtgaccatt gccccaaggg cgcgtgcggc ctgcagacca agatccaggg 660 
acctagaggc ctcggcgatg acactgcgct gaacgacgcg cgcttattct gctgccgcag 720 
ttgaacggcg ccgccgccgc cgctctctcc cgggccagga ggctagtccc acctcttgct 780 
attaaagct 789 



<210> 25 

<211> 1793 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 5587790CB1 



<400> 25 

cagaccccga ccccgacccg gaccccgagc 
tccccgggct ccgcgccctg ctgccggcgc 
tcctgctggc cgagtcgggt ctctcattca 
catctgcctt cctcacagcg cctctgctgg 
ctgatgccat gcagctgaat gacaagtggc 
tgcgctgtgt caccgcggcc ctcatctact 
actcggatgg ggcttccaaa gccgctgggg 
caactgattt ctacctgatc tttaacgacg 
cagatgagac cacagcccac aagacagaag 
ggcctggcgg gtgccttggc aacctgagcc 
ggtcgctggc gttggagcgg aggcctggac 
gcaggccccc tacagcctca ggttctgcct 
tcagcactgt tgaccacgct gcgtatgagg 
tttctgtccc acaggccttc agccctttaa 
aaagcagagc cttgtctgta tctgggcagc 
cgggggtctt ctgtttcact aacaggaaca 
caggccccgg tgggtttgtc tgcacttggt 
aatgaccctt ccaagaatct ttggttcaag 
agggagaaat tgacctttgc cttgtcgccc 
acctcccacc cttggaaacc atgtcttctg 
tgtttcaaag aagagctcat agactgactg 
tcacagtgac agtgtcatgg ggagctgggc 
gcccagccag caggcaggag ttcctggacc 
caggtctatg ggccactgca ggagatgaga 
actgtgtatc tgtctttgat ggaattttgt 
cttcttaaca gatggcattt tctgtgactc 
cccatgtttg gagcatttgt agccttattc 
aagccataat gaattaagcc agactgacca 
gagcaatact gttgggtatt gtatcaggct 
cctgacagct ttcagtttta gcaaaaatac 

<210> 26 

<211> 3694 

<212> DNA 

<2 1 3 > Homo s ap i ens 



ctgccggcgg ctcccgtccc ggccccgcgg 60 
gggctttcct ctgctctctc aaaggccgcc 120 
tcacrtttat ctgctatgtg gcgtcctcag 180 
agttcctgct ggccttgtac ttcctctttg 240 
agggcttgtg ctggcccatg atggacttcc 3 00 
ttgctatctc catcacggcc atcgccaagt 360 
tgtttggctt ctttgctacc atcgtgtttg 420 
tggccaaatt cctcaaacaa ggggactctg 480 
aagagaattc cgactcggac tctgactgaa 540 
acacaggcct ccacccctgt gcctcacagg 600 
ttctgagttg cagagggggc tgcggacaca 6 60 
gagcccagcc taccaggctt gcccctcagc 720 
gcatcttggg tatcccactc cttctcccca 780 
cgtctctgcc aaaaaccagc acaaggagac 840 
aggtgttcca tgctgctagg tggcgggggt 900 
aagacagaaa ccatgacagg gctgccccgc 9 60 
gctcctgccc acaccagcca ctttggtgac 1020 
gagcaccagt tccctcttca ttcttgaagc 1080 
aggaagtggg gctcggcacc cataactaac 1140 
ggggtgagat gaccattctg ggtctaagac 1200 
gtccagaaga cagagggtac aacagtggca 12 60 
gggcccagcc aaaccctcct tcttcctaga 1320 
ctcaggacag tgaacttcca gacctcaggg 1380 
ccagccttct gtgttcacct aacgatttat 1440 
aactttttat atttttttat gcaaaagcag 1500 
taggcctcac aaaagagcca gagttctgga 1560 
tcttgcgtgt gaatctctta ccctgaaaaa 1620 
cttgcttgga gtgtgtgctt gaaaaaacca 1680 
tcagtacaaa ctggtaacac caatgtggat 1740 
acgtgaaatc tgaaaaaaaa aaa 17 93 
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<220> 

<221> mi sc_ feature 

<223> Incyte ID No.: 5733930CB1 

<400> 26 

ctggcaggtt cctagtgagc ctcctgggca ggggtctgga gggtcggaga aggagccccg 60 
cggctcccac gcccccacct gcctccagtc ttgaagtggc tctgccgcca gaagctggca 120 
gccctacctg tctcagattt ctggagggat gggtaggagc cagggttcgt gcccacggcg 180 
cctccctgct ccctgcagct gagtcccaac gggggccagg aggacacgcg gatgaagaac 240 
gtgccggtgc cggtgtactg ccgccctc tg gtggagaagg accccaccat gaagctgtgg 3 00 
tgtgccgcgg gcgtcaacct gagcgggtgg aggcccaatg aggacgacgc tgggaatgga 360 
gtcaagccag cgccaggccg cgatcccctg acctgcgacc gcgaaggaga cggcgagccc 42 0 
aagagcgccc acacgtctcc cgagaagaag aaggccaagg agctccctga aatggacgcc 480 
acctccagcc gggtgtggat cctgaccagc accctgacca ccagcaaggt ggtgatcatc 540 
gacgccaacc agccgggcac ggtggtggac cagttcaccg tctgcaacgc gcacgtgctg 600 
tgcatctcca gcatccccgc ggccagcgac agcgactacc ctcccgggga gatgttcctg 660 
gacagcgacg tgaacccaga ggacccgggc gcagatggcg tgctggccgg tatcaccctg 72 0 
gtgggctgtg ccacccgctg caacgtgccg cggagcaact gctcctcccg aggggacacc 780 
ccagtgctag acaaggggca gggggaggtg gccaccatcg ccaacgggaa ggtcaacccg 84 0 
tcccagtcca cagaggaggc cacagaggcc acggaggtgc cagaccctgg gcccagcgag 900 
ccagagacag ccacattgcg gcccgggcct ctcacagagc acgtcttcac tgacccagcc 960 
ccgaccccgt cctctggccc ccagcctggc agcgagaacg ggccagagcc tgacagcagc 1020 
agcacacggc cagagccaga gcccagcggg gaccccacgg gagcaggcag cagtgctgca 1080 
cccaccatgt ggctgggagc ccagaacggc tggctctatg tgcactcggc tgtggccaac 1140 
tggaagaagt gcctgcactc catcaagctg aaggattctg tgctgagcct ggtgcatgtc 1200 
aaaggccgtg tgctggtggc tctggcggac gggaccctgg ccatcttcca ccgtggtgaa 12 60 
gatggccagt gggatctgag caactatcac ctaatggacc tgggccaccc gcaccactcc 1320 
atccgctgca tggctgttgt gtacgaccgc gtgtggtgtg gctacaagaa caaggtgcac 1380 
gtcatccagc ccaagaccat gcagatagag aagtcatttg acgcccaccc gcggcgggag 1440 
agccaggtgc ggcagctggc gtggatcggc gatggcgtat gggtgtccat ccgcctggac 1500 
tccaccctga ggctctacca tgcacacacg caccagcatc tacaggacgt ggacattgag 1560 
ccctacgtca gcaagatgct aggcactggc aagctgggtt tctccttcgt acgcatcacg 1620 
gccctgcttg tcgcgggcag ccggctctgg gtgggcaccg gcaacggagt ggtcatctcc 1680 
atccccctga cagagactgt ggtcctgcac cgaggccagc tcctggggct ccgagccaat 174 0 
aagacatccc ccacctctgg ggagggcgcc cgtcccgggg gcatcatcca cgtgtatggc 1800 
gatgacagca gtgacagggc ggccagcagc ttcatcccct actgctccat ggcccaggcc 1860 
cagctatgct tccatgggca ccgcgatgcc gtgaagttct ttgtctcggt gccagggaac 1920 
gtgctggcca ccctgaatgg cagtgtgctg gacagcccag ccgagggccc tgggccagct 1980 
gcccctgcct cggaggtcga gggccagaag ctgcggaacg tgctggtgct gagcggcggg 2040 
gagggctaca tcgacttccg cattggagac ggagaggacg acgagacgga ggagggcgca 2100 
ggggacatga gccaggtgaa gcccgtgctg tccaaggcag agcgcagtca catcatcgtg 2160 
tggcaggtgt cctacacccc cgagtgaagc tgctgccctg cctggcccga cctgtacata 2220 
ggacccccga ccacctgacc cccgcccggc ccgcggggta gccagccagg cgccgccgcc 2280 
cctcttctaa cctctcaacc tgcagctttc acctgagtct ggcccctcca gcgggcaggg 2340 
agtgcgggga tgcggatcag ctgggaggag gaggggaggg gtgcttccac ccgaggggaa 2400 
gatgctctcg ggacagtttc ccgggcagct cctggccagc ttccagccca gagtcctcaa 2460 
gtccagggca ccttgggccc agcgcaggca gaatccgagg tggtcctggc tctaccctgg 2520 
gcctcctact ccccagcacc cctggaggag gcaggggctc cccgccgccg aggctgcctg 2580 
ccctgggccc acctctgcat gctgctcatg gggccaccct gcctcctggg ccctcactct 2640 
gcctagggga gctgggccag gcactagcct ttgcccaggg aggtgggcct caggctgccc 2700 
aggtgcctgc accccagccg gccttctctg gggcctcccc gtcgtcaagc ctctatcctg 2760 
tctgtcccca ccccagctgt cccctgccca gggagctggc ataaaagcac gaggcccggc 2820 
tccctggggc agctgcttga gaacagagac tgctacccca tcctgcccat gcaggcaggc 2880 
tcttgccagc cccgttctga cccgtgtccc cccaggctct gcctgggcag aagactcacc 2940 
ttggaggagt gggccctgga gtcctgtccc tcccagaagc ccccagggtg ggatttctca 3 000 
ggctgccagg gcaggcccag gcctcaggaa gaaggggagg cccc tggcct ctccgggatc 3 060 
agtcctagga cacaggctca gcctcaggtt gatgggggat gatgtgctcc cggggcctgc 312 0 
ctcctgcacg gggctccacg gagcccagct cccagacacg ctactaagtg cctagggttg 3180 
cccgctgtgg cctgctccca gggagcaaca gagaggccac caagcagagg cccgtggggc 3240 
tgaggatgga gccgccccca gccgactcca agcccgcaga gggcagacgc caccctggac 3300 
tgctctccct gcccagctgg gcctctctgg cctattccta ccttccaggc ccactgcact 3360 
cctgtctggg aggcccttat gagggcagcc cagcccccgc acccaccccc aaccagagaa 3420 
gcacagatct tggggagctg ccccacaagc cccgctggcc accgagggct gcagccgctg 3480 
cgctgccggc ttctccccac caccctgcca cctccactgt gatgtatgtc cgctccctcg 3540 
tctgttcccc caggatctcg aagtgactcc gggctgagca gtggggcggc tgggggaggg 3 600 
gtgacgattc tcctcaggct ttggccctgc aagcaaaccc acatatctgc tctgtatgta 3660 
ataaatgtct taacgtcgta aaaaaaaaaa aaaa 3 694 
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<210> 27 
<211> 2000 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No.: 645566CB1 



<400> 27 

cgacaggtca gagctgcggc ctgagcagcc 
tgctgcctgc ttcttgctcc agcaccatgg 
cccgcgcgat gagacttccc cggcggacgc 
tgggtcctcc cgttgctgcc tgcggccgcc 
cgcagctttg cgggcccgac cggctccgcg 
ctgacgtctc tcaagccact ttagccagtg 
acaaacaggg aaacgttact tcttttgaaa 
gtcttcaagc cagagatttg agatttcagc 
ggattatcat gagaatggag tatttgaaag 
tagattatcg taatttaaac ttagagcaat 
ctggagaggg tcaactcgtt acataccctt 
tcctgcaata ttggatcaac acccttcagg 
ttgagacctt ggatgctttg gtggacccca 
acattttact acagaatggc aaaagtctat 
aagagtcaat tttggagatc ttggatgagg 
aatggagtga cccacaagtc tttgaaaaga 
tggagttgct gttggaaaac tactaccgat 
agcttagggt gctgattgat gattcacaaa 
gaaacgtgat gatgaggttg aatctacagc 
ttggactaat gggagttgct tttggaatga 
gaattttttg gctgattaca ggaattatgt 
tgctttcatt ccttggacga cagctagaag 
ctaaaaagac tcttctggca gatagaagca 
gacttggatc aggaaggagc atcctaacaa 
gtttttttta tggtagttac aggaaacttc 
agtcagacac ttgaaaaaaa ctaatgtttg 
cagaactgga ttgcatttcc agaattctga 
aggccaaaat tctatttcct acaaacttta 
aacacaagca cagacagttg catagatttt 
catgctgaat gaagcgtagg aacttgacaa 
atagggaatt atgttcatgt gaatttcctg 
atgtattaac cttacaacag actctgtaag 
accttatttc ttgtatttgc cccctttttt 
aatatcaaaa aaaaaaaaaa 



agcgtccggc atgaaggtct ggggtctggc 60 
aatgcctgcg cagtttaccc tgcctcctgc 120 
tgtgtgccct ggccttggac gtgacctctg 180 
gagccaacct gattggaagg agccgagcgg 240 
tggcaggtga agtgcaccgg tttagaacct 300 
tagccccagt atttactgtg acaaaatttg 360 
ggaagaaaac tgaattatac caagagttag 420 
atgtaatgag tatcacagtc agaaacaata 480 
ctgtgataac tccagagtgt cttctgatat 540 
ggctgttccg ggaactccct tcacagttgt 600 
taccttttga gtttagagct atagaagcac 660 
ggaaacttag cattttgcag ccactgatcc 720 
aacattcttc tgtagacaga agcaaactgc 780 
cagagttaga aacagatatt aaaattttca 840 
aagagttgct agaagagctc tgtgtatcaa 9 00 
gcagtgctgg gattgaccat gcagaagaaa 960 
tggctgacga tctctccaat gcagctcgtg 1020 
gtattatttt cattaatctg gacagccacc 1080 
tgaccatggg aaccttctct ctttcgctct 1140 
atttggaatc ttcccttgaa gaggaccata 1200 
tcatgggaag tggcctcatc tggaggcgcc 12 60 
ctccattgcc tcctatgatg gcttctttac 1320 
tggaattgaa aaatagcctc agactggatg 13 80 
accgttagga acagccccgt ggatactgaa 1440 
tgatactctt tttattattt tcttgtatag 1500 
aagacaaaaa tattttggca gtcacaatac 1560 
gttaaagaaa caaagtattt gctttgtaaa 1620 
aatgctgttt ttatagatgt gatatgaggc 1680 
aatttataca tatcaagaaa agtgcaattt 1740 
gcccataggt agctatagtt ctttgtcagt 1800 
attctcaggt gactaaaaag ctagcattct 1860 
tttgagcttt aaaaaccaaa ctttgacata 1920 
ttataaaagg tgaataaaaa gaaataattt 1980 

2000 



<210> 28 
<211> 2973 
<212> DNA 

<2 13 > Homo sapi ens 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1503111CB1 



<400> 28 

cggctcgagg gaaacaggaa aataaaaatg ttcaaactcc ttggatgttg ggataaactc 60 
acctgaaccc acttgggttc gggctgcctc cttctcttcc ttcattgcca cctttcctct 120 
gtgtggctcc cgggagtgtg cggttaagtc atcagactcg aagtgqctag agatccggag 180 
gaagccgcgc cggtcttccc ctgacatgcg tggcatgccg gggctccgta ggaggtttgc 240 
tatacctggg aggaccctgg cattctaaat ttcagctccg ggaaagagaa ggggcttttt 300 
gccttttatc tttttttttt ctttctttaa gtagtaattt tttaactgat tcattgtttg 360 
gaaagcgcat attgcttccc tcttccccga attctggcaa ctcttcctcc tgctatgatg 420 
ggcccttggg catcatgaac ttcattactc ctcactggct ggaattcaaa ctgcccatct 480 
gtagtggtcc cgtgcgttga ccatgcacct gagaatccac gcgagacgga gccctcctcg 540 
ccggccggcc tggacgcttg ggatctggtt cctgttctgg ggatgtatcg tcagctctgt 600 
atggagttct tctaatgtag cttcctcctc ctccacctct tcctcgccgg ggtctcactc 660 
tcagcacgag caccatttcc atggcagcaa gcatcactca gtgcctattt ctatctatcg 720 
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ttcccctgtt tcccttcgag gaggacacgc tggcgctacg tacatctttg ggaaaagtgg 780 
tgggcttatc ctctacacct ggccagccaa tgacaggccc agcacgcggc ctgaccgcct 840 
tgccgtgggc ttcagcacca ctgtgaagga tggcatcttg gtccgcatcg acagtgctcc 900 
aggacttggt gacttcctcc agcttcacat agaacagggg aaaattggag ttgtcttcaa 960 
cattggcaca gttgacatct ccatcaaaga ggagagaacc cctgtaaatg acggcaaata 1020 
ccatgtggta cgcttcacca ggaacggcgg caacgccacc ctgcaggtgg acaactggcc 1080 
agtgaatgaa cattatccta caggcaacac tgataatgaa cgcttccaaa tggtaaaaca 1140 
gaaaatcccc ttcaaatata atcggcctgt agaggagtgg ctgcaggaaa aaggccggca 1200 
gttaaccatc ttcaacactc aggcgcaaat agccattggt ggaaaggaca aaggacgcct 1260 
cttccaaggc caactctctg ggctctatta tgatggtttg aaagtactga acatggcggc 1320 
tgagaacaac cccaatatta aaatcaatgg aagtgttcgg ctggttggag aagtcccatc 1380 
aattttggga acaacacaga cgacctccat gccaccagaa atgtctacta ctgtcatgga 1440 
aaccactact acaatggcga ctaccacaac ccgtaagaat cgctctacag ccagcattca 1500 
gccaacatca gatgatcttg tttcatctgc tgaatgttca agtgatgatg aagactttgt 1560 
tgaatgtgag ccgagtacag gaggtgaatt agttatccct cttcttgtag aagacccttt 1620 
agctacccct cctattgcta ctcgtgcacc ttccattaca ctccccccta cctttcgccc 1680 
cctcctcacc attattgaga ccaccaaaga ttccctgtcc atgacctctg aggcggggtt 1740 
accttgcttg tcggaccaag gcagcgatgg ttgtgatgat gatggcttgg tgatatctgg 1800 
gtatggctca ggggaaacct ttgactctaa cctgccccct actgatgatg aagattttta 1860 
caccaccttc tccttggtaa cagataagag tctttccact tcaatcttcg aaggtggcta 1920 
caaagcacat gcgcccaagt gggaatccaa ggactttaga cctaacaaag tctccgaaac 1980 
tagtaggact actaccacat ctttatcccc tgagctgatc cgcttcacag cttcctcctc 2040 
gtctgggatg gtgcccaaat tgccagctgg caaaatgaat aaccgtgatc tcaaacccca 2100 
gcctgatata gtcttgcttc cgttgcccac tgcctatgag ctagacagca ccaaactgaa 2160 
gagcccacta attacttccc ccatgttccg taatgtgccc acagcaaacc ccacggagcc 2220 
gggaatcaga cgggttccgg gggcctcaga ggtgatccgg gagtcgagca gcacaacagg 2280 
gatggtcgtc ggcattgtgg ctgctgccgc cctctgcatc ttgatcctcc tgtacgccat 2340 
gtacaagtac aggaacaggg acgaggggtc ctatcaagtg gacgagacgc ggaactacat 2400 
cagcaactcc gcccagagca acggcacgct catgaaggag aagcagcaga gctcgaagag 2460 
cggccacaag aaacagaaaa acaaggacag ggagtattac gtgtaaacat gcgaacactg 2520 
ctcacacgcg agttttcaca gttatttcta tccacgccta tgaatctttg gacggtgaga 2580 
tctcacagat gtcagaactg ctggaactat gaaatggggt atataaccac gactctggtg 2640 
gggaaaaccg ttttttaaag gacacacaca cacacagcga tgcatctctc tctaaagctc 2700 
agccacggct gcggcaaggt cccagcggtc gctgggagac agaaggtttt gtgccctgct 27 60 
gtatcataaa gcacacactt agcgctctgg agccggacgg tggctccacc acttccgcag 2820 
gcctggaaac ttccttctcc ggaggacctt ttactaaaag gtagaagact tcatggctta 2880 
cttgttccat aactccaagt gagtctgtaa tgtttgtgaa gcttgactgt aaccatgttt 2940 
tttctgttaa gatatgtaaa aagcgaacgt ate 2973 

<210> 29 
<211> 2394 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1984287CB1 

<400> 29 

aggaaaatgg eggtegctgg agccgccgac caagaggctt gggagtctgt acctttcccg 60 
accgggccac tggaagttgg agcctccgcc gagtegcaga caacgcctcc gggagggect 12 0 
tectgatgeg cttgcctgct ccctggtctc tctgcatggg gaaggagtgt tcccagcttg 180 
caaactccag etttgectgt gagaggaaca agcgtccctg atccagaagg tgttcggatg 240 
gagatggega gttctgctgg ctcctggctc tctggctgcc tcatccctct tgtcttcctc 3 00 
cggctgtctg tgtatgtgtc aggccacgca ggggatgccg gcaagttcca cgtggcccta 3 60 
ctagggggca cagccgagct gctctgccct ctctccctct ggcccgggac ggtacccaag 420 
gaggtgaggt ggctgcggtc cccattcccg cagcgctccc aggctgttca catattcegg 480 
gatgggaagg accaggatga agatctgatg ceggaatata aggggaggac ggtgctagtg 54 0 
agagatgece aagagggaag tgtcactctg cagatccttg acgtgcgcct tgaggaccaa 600 
gggtcttacc gatgtctgat ccaagttgga aatctgagta aagaggacac cgtgatcctg 660 
caggttgcag ccccatctgt ggggagtctc tccccctcag cagtggctct ggctgtgatc 720 
ctgcctgtcc tggtacttct catcatggtg tgcctttgcc ttatctggaa gcaaagaaga 780 
gcaaaagaaa agcttctcta tgaacatgtg acggaggtgg acaatcttct ttcagaccat 84 0 
gctaaagaaa aaggaaaact ccataaagct gtcaagaaac tccggagtga actgaagttg 9 00 
aaaagagctg cagcaaactc aggctggaga agageceggt tgcattttgt ggcagtgacc 96 0 
ctggacccag acacagcaca tcccaaactc atcctttctg aggaccaaag atgtgtaagg 1020 
cttggagaca gacggcagcc tgtacctgac aacccccaga gatttgattt cgttgtcagc 1080 
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atcctaggct ctgagtactt cacgactggc tgccactact gggaggtgta tgtgggagac 1140 
aagaccaaat ggattcttgg agtatgtagt gagtcagtga gcaggaaggg gaaggttact 1200 
gcctcacctg ccaatggaca ctggcttctg cgacagagtc gtgggaatga gtatgaagct 1260 
ctcacatccc cgcagacctc cttccgcctt aaagagcctc cacggtgtgt ggggattttc 1320 
ctggactatg aagcaggagt catctctttc tacaatgtga ccaacaagtc ccacatcttt 1380 
actttcaccc acaatttctc tggccccctt cgccctttct ttgaaccttg ccttcatgat 1440 
ggaggaaaaa acacagcacc tctagtcatt tgttcagaac tacacaaatc agaggaatca 1500 
attgtcccca ggccagaagg gaaaggccat gctaatggag atgtgtccct caaggtgaac 1560 
tcttctttac tacccccgaa ggccccagag ctgaaggata taatcctgtc cttgccccct 1620 
gaccttggcc cagcccttca ggagctcaag gctccttctt tttagggata tgccacatta 1680 
cctgctccca tcaccatcca gcccagcacc ctggacttca gtcgcctggc ccaaccccat 1740 
gattatggaa cgtctcttca ccttaaccca aatccagacc cttttgtggt ttctatttgt 1800 
accacttttc tcccaggcct cagttctgaa gcttaccttt cttctaagga attgaagctc 1860 
ccagtgacct ggagggagga ttcctggaaa ccaaacaatc agtttaggtg caggtggaga 192 0 
tgttgaatat gtgttaccaa gatacagcac aggttcaggg aaaagagttc gctactccag 1980 
gggttattta gaagacactt tctctgcctc atcctgccct caagctttag tcaagaagtt 2040 
atggccccca gtccctgact tcttacttat cccattgagg actgcctttc tctctctcag 2100 
ttctggcctc tgccccccaa agtcagctct ctaaaagcaa gcatgttttt agaccactca 2160 
ctctttccct ctttcttcag gaatgaattg ggaaaggctg atgagtaaaa cataccatcc 2220 
ttttctattt tcttgatgct gtttacaaca tagtttggtg atatccagag ctaatgtaca 2280 
tgctttcaaa agctaatctg cctgttgatg ataactaggt acagcgactt taaatacagt 234 0 
tgctataatc ctgaaaagcc ccaggagcac atcagggagc tggagaacac agtt 23 94 

<210> 30 

<211> 1853 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 2055289CB1 

<400> 30 

gctgccagcc cggccaggca cccctgcagc atggcctgga acaccaacct ccgctggcgg 60 
ctgccgctca cctgcctgct cctgcaggtg attatggtga ttctcttcgg ggtgttcgtg 120 
cgctacgact tcgaggccga cgcccactgg tggtcagaga ggacgcacaa gaacttgagc 180 
gacatggaga acgaattcta ctatcgctac ccaagcttcc aggacgtgca cgtgatggtc 24 0 
ttcgtgggct tcggcttcct catgactttc ctgcagcgct acggcttcag cgccgtgggc 3 00 
ttcaacttcc tgttggcagc cttcggcatc cagtgggcgc tgctcatgca gggctggttc 3 60 
cacttcttac aagaccgcta catcgtcgtg ggcgtggaga acctcatcaa cgctgacttc 42 0 
tgcgtggcct ctgtctgcgt ggcctttggg gcagttctgg gtaaagtcag ccccattcag 480 
ctgctcatca tgactttctt ccaagtgacc ctcttcgctg tgaatgagtt cattctcctt 540 
aacctgctaa aggtgaagga tgcaggaggc tccatgacca tccacacatt tggcgcctac 600 
tttgggctca cagtgacccg gatcctctac cgacgcaacc tagagcagag caaggagaga 660 
cagaattctg tgtaccagtc ggacctcttt gccatgattg gcaccctctt cctgtggatg 720 
tactggccca gcttcaactc agccatatcc taccatgggg acagccagca ccgagccgcc 780 
atcaacacct actgctcctt ggcagcctgc gtgcttacct cggtggcaat atccagtgcc 840 
ctgcacaaga agggcaagct ggacatggtg cacatccaga atgccacgct cgcaggaggg 9 00 
gtggccgtgg gtaccgctgc tgagatgatg ctcatgcctt acggtgccct catcatcggc 960 
ttcgtctgcg gcatcatctc caccctgggt tttgtatacc tgaccccatt cctggagtcc 1020 
cggctgcaca tccaggacac atgtggcatt aacaatctgc atggcattcc tggcatcata 1080 
ggcggcatcg tgggtgctgt gacagcggcc tccgccagcc ttgaagtcta tggaaaagaa 1140 
gggcttgtcc attcctttga ctttcaaggt ttcaacgggg actggaccgc aagaacacag 1200 
ggaaagttcc agatttatgg tctcttggtg accctggcca tggccctgat gggtggcatc 1260 
attgtggggc tcattttgag attaccattc tggggacaac cttcagatga gaactgcttt 1320 
gaggatgcgg tctactggga gatgcctgaa gggaacagca ctgtctacat ccctgaggac 1380 
cccaccttca agccctcagg accctcagta ccctcagtac ccatggtgtc cccactaccc 1440 
atggcttcct cggtaccctt ggtaccctag gctcccaggg caggtgagga gcaggctcca 1500 
cagactgtcc tggggcccag aggagctggt gctgacctag ctagggacgc aagagtgagc 1560 
aagcagcacc cccacctgct ggcttggcct caaggtgcct ccacccctgc cctccccttc 1620 
atcccagggg gtctgcctga gaatggagaa ggagaagcta caaagtgggc atccaagccg 1680 
ggttctggct gcagaagttc tgcctctgcc tggggtcttg gccacattgg agaaaaacag 1740 
gctcaaagtg gggctgggac ctggtgggtg aacctgagct ctcccaggag acaacttagc 1800 
tgccagtcac cacctatgag gctcttctac cccgtgcctg cacctcggcc age 1853 

<210> 31 
<211> 3617 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 2279216CB1 

<400> 31 

gaccgtgcta gccaggaggc gggagcgatc cacagcagct gacccagctc aggcagtgct 60 
ctctctgagt cctcaagata caccatgggc ccagaggcag gtttgctaca cagcagcgac 120 
gacgcaggcg gcggccccag cgactcgcaa ctgcctccct gaccacagcg gccaccgccc 180 
aacacccccg agaagccatc gccaccaccg gcaggagaac ctagggtcca taaagccatc 240 
ttcgcgatcg actaaagcta cgtcaacaac tatggcgggc gacgggcggc gggcagaggc 3 00 
ggtgcgggaa ggatggggtg tgtacgtcac ccccagggcc cccatccgag agggaagggg 3 60 
ccggctcgcc cctcaaaatg gcggctgcag cgatgcgcct gcgtacagaa ctcctccgtc 420 
gcaccagggc cggcgggaag tgaggttctc ggacgagccg ccagaagtgt acggcgactt 480 
cgagcccctg gtggccaaag aaaggtcccc ggtgggaaaa cgaacccggc tagaagagtt 540 
ccggtccgat tctgcgaaag aggaagtgag agaaagcgcg tactaccttc ggtctaggca 600 
gcggaggcag ccgcgacccc aggaaaccga ggaaatgaag acgcgaagga ctacccgcct 660 
tcagcagcag cactcagagc agcctccgct acagccgtct cctgttatga ccaggagagg 720 
gctgcgggac tctcattcct ctgaagagga tgaagcatct tcccaaactg atttaagcca 780 
aacgatctca aagaaaactg tcaggagcat acaagaggct ccagcagtga gtgaagatct 84 0 
tgtaa'tcagg ttacgtcgac cccctctaag atacccaaga tatgaagcca ccagtgtcca 900 
acagaaggtc aatttctctg aagaaggaga aactgaagaa gatgatcaag acagctctca 9 60 
cagcagtgtc actactgtta aggccagatc cagggattct gatgaatctg gagataaaac 1020 
caccagatca tctagtcaat atatagaatc attttggcag tcatcacaaa gtcaaaactt 1080 
cacagctcat gataagcaac cttcagtgct aagctcagga tatcaaaaaa ctccccagga 1140 
atgggcccca caaactgcaa gaataaggac caggatgcaa acatcttcac caggcaagag 12 00 
ttcaatatat ggcagttttt cagatgatga cagcattctg aaatcagagc ttggaaacca 1260 
gtcaccatca acctccagcc gacaagtgac tggacaaccc caaaatgcat cttttgtcaa 1320 
gaggaaccgg tggtggctac ttcctctgat agctgctctt gcctctggga gtttttggtt 1380 
ctttagtact cctgaggtag aaaccactgc tgttcaagag ttccagaacc agatgaatca 144 0 
acttaagaat aagtaccaag gtcaagatga gaagctgtgg aaaaggagcc aaacattcct 15 GO 
ggaaaaacat cttaatagct cccatcctcg gtctcagcct gctatcttac tgctcactgc 1560 
tgcccgagat gctgaagaag cacttaggtg tctgagtgaa caaattgctg atgcctattc 162 0 
ttcttttcgt agtgtccgtg ccatccggat tgatgggaca gataaagcta ctcaagacag 1680 
tgatactgtc aaactagagg tagaccaaga actgagcaat ggatttaaga atggccagaa 1740 
tgcagctgtg gtacaccgct ttgagtcatt tcccgcaggc tctactttga tcttctacaa 1800 
atattgtgac catgaaaacg cagccttcaa agatgtagcc ttagtcctga ctgtcttatt 1860 
ggaggaagag acacttggaa caagtctagg cctaaaggaa gttgaagaaa aagtaagaga 1920 
ttttcttaaa gtcaagttca ccaattctaa cacacccaac tcctacaatc atatggaccc 1980 
agacaaactg aatggcctct ggagccgtat ttctcactta gttctgcctg tgcaacctga 2040 
aaatgccctg aaaaggggca tctgcttata agaagtgaga gagaagaaaa atcatgtccc 2100 
aagttctgag aattgttcac actttctaac cagagacaga attcagagct ctttttgaaa 2160 
gaactagttt ctttttaaag aagttaagtg cttacataaa catggaacat ataaatcatt 2220 
cattcaacct aattatctgt gatatgagag aatcatttca gtttccattg agagctctgt 2280 
taaaggtatc ttaggagtgc agattatatg cagttcctta gagaatctgt tttgattctg 2340 
ggctgagtta ttacagttat ggtatgacca gtgagagcga tttgtgtcct ttcttatgaa 2400 
ccttcctgat tttttaactt agatttctca ctaagtttcc tgagttatta gtaagattgc 2460 
tccctaaatg taaccaggga tttttccatt atgttccctt ttataccatt taggtgtgag 2520 
ttccttagct tctgcctata ggaacataag taatgaaagg tcatctaggt gtgtgtttgg 2580 
aatttttttc ttttgttttt tggaaaatgg aaagaacaag gcaaggaagg aaaattaatg 2640 
gggaagctga agggaggaaa tgttacagta atccactgag atgtagttta gtcaaacata 2700 
ggtatatgaa ctgttaatct tggtggattc cgttgggatt tgttttttac atctcctttt 2760 
tccttccttg aagaaaaatt cttggctgtc ccaaggattc ttcatgtata ttgcaaaatt 2820 
taatatattt gttcacttga gtcttaagag taggtcaggc caaggcaggt ggatcacctg 2880 
aggtcagaag ttcaagacca gcctggccaa catggtgaaa ccccctctct actaaaaata 294 0 
caaaaattag ctgggcgtgg tggtgcatgc ctgaaatccc agctacttgg gaggctgagg 3 000 
caggagaatt gcttgaaccc gggaggcgga ggttgcagta agccgagacc atgcttattg 3 060 
ccctccagtc tgggcaacaa aaatgaaact ccatctcaaa aaaaaaaaaa aaggaaagaa 312 0 
aagaaaagaa acagtcctct tacggtaatg agggtcacct gaaagtttct aggtaaaggc 3180 
agtgtatgcc ccaggaaggg ctgaggagtt ccaaatgtta ctgctgtgag ctcaggggct 3240 
cgccactgac cagtcccttg acttgcatct tgaaaaatca gaactcactt tgcggctcta 33 00 
gcttctggat gatgggcaga gcactggtca tggccacgga ggtgatggtc ttcacaccgt 33 60 
tctctgccat ctcacacaca gacgtcaggt agggatactg gtcctttgta ctgagatagg 3420 
ctgaggacat gaggtcatac gtggagctca ccaagggcag gttgaccacc cgagt caeca 348 0 
cactcggttg tggatcaact geaeggatge catttttctc cctggagaaa gaaatcgacg 3540 
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gtctgcagcg aaaggcgaag agcaggcgcg tcccgaagac gactccggtg ccatatccgt 
gcgcatnggt cggacca 

<210> 32 
<211> 1029 
<212> DNA 

<213> Homo sapiens 



3600 
3617 



<220> 

<221> misc_feature 
<223> Incyte ID No. 



2590650CB1 



<400> 32 
ggtgggccca 

ggggggcctg 

gaagccagca 
aagggagccc 
gagccaggag 
ctgttgagcc 
gccctcccag 
cgccaggatg 
aaccaagagt 
aactgggagc 
gcgaccttcc 
gagggcactg 
gctgaccacg 
gcatcctgtt 
aaatatggca 
atgatgcttc 
gatcgtcagc 
aaatcattt 



gcaagatgga 
cctgcc tgaa 
aagttgtcct 
caggtcctca 
atccagtgaa 
agggcgccac 
tcttttgtga 
gttctgtgga 
ctgaattctg 
tgcgggtaga 
gcctcctcgg 
caggggattc 
attcaagcaa 
accgatcaaa 
ttgactgggc 
gatagggcac 
caccttgcct 



tctactgtgg 
gacccaggaa 
cctgcccagt 
agggccacct 
cctgctccgg 
cttgagcggc 
catggacacc 
t ttcttccgc 
gctgggaaat 
gc tggaagac 
tgaggtagac 
cctgagcctc 
cagcaactgt 
tctcaatggt 
ctcaggccgt 
tctggcagcc 
ttgccaacca 



atcctgccct 
caccccagct 
tgtcccggag 
ggaccaccag 
tgccaggaag 
tggtaccatc 
gaggggggcg 
tc ttggtcct 
gagaatttgc 
tt taatggta 
cactaccagc 
cacagtggga 
gcagtgattg 
cgctatgcag 
ggtgtgggcc 
agtgccc tta 
cctctgcttg 



ccctgtggct 
gcccaggacc 
ctccaggaag 
gcaagatggg 
gccccagaaa 
tgtgcctacc 
gctggctggt 
cctacagagc 
accagcttac 
accgtacttt 
tggcactggg 
ggccctttac 
tccacggtgc 
tgtctgaggc 
acccctaccg 
tctctcctgt 
cctgtccaca 



tctcctgctt 
cagggaactg 
tcctggggag 
ccccaagggt 
ctgccgggag 
tgagggcagg 
gtttcagagg 
aggttttggg 
tctccagggt 
cgcccactat 
caagttctca 
cacc tatgac 
c tggtggtat 
tgccgcccac 
cagggttcgg 
acagcttccg 
tttaaaaata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1029 



<210> 33 

<211> 1923 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 2814726CB1 



<400> 33 

aacgtttatt 

aagaaacaaa 

ctctgagtcc 

aagacctaaa 

agtcagaaat 

aggagctagc 

aaagaaaagc 

cccaattgca 

tcacaagtac 

cagcagcaaa 

agaaggtgag 

tcacaagtct 

aagaagcaac 

aaaccgaatg 

aagaaaaaga 

tccttcaaat 

ttcgggattc 

tgcagaaaga 

tgactgagca 

agctgctccg 

tcggcctatt 

cctggccctg 

tgccaggtct 

ctctagaagc 

cctcttaaaa 



ttccttggca 
gctttcaaaa 
aagcaaggaa 
tgagcctctt 
tgaggcaaag 
tagaacaagt 
ctatcgaaat 
gaggttacac 
tagagatgaa 
ggttgcctct 
age t gage tt 
gcaaaacagt 
aaggttgcaa 
ccattctcta 
attgeacaat 
gtctaggaaa 
agctgattta 
gtatgaaaag 
ggaaaagcag 
agagaaagga 
cctggctttc 
gatgeccatg 
ggecagaget 
tggcatcaca 
cagcagaagt 



gagaagctga 
gaaaatcaga 
aaaagcagtg 
gccaaagtgt 
caagaaatac 
aaacaaaaat 
caagttgagg 
atcgatactg 
ttgcttagtg 
gagegggaca 
gagcggtggc 
tttcagctta 
ggtgaactag 
aaaagggaaa 
tctcagaagc 
gaacttgaga 
aaaactcttc 
acacagactg 
tcaatcacag 
aataatcctt 
ctgttttggt 
ttggctgccc 
tctccatgag 
etcatgetgg 
acaagaatac 



tegtcgaagg 
caagagcaaa 
acgacactac 
cccttttaaa 
agcatcttcg 
gc tttgaact 
aatccactaa 
agaatctccg 
cccgagatga 
ctgacattgc 
ggaaagcagc 
gatgtcaaca 
agaagttgag 
atgttttgct 
agagtttaga 
atcaagtggg 
tcagtaaggc 
tactctcaga 
atgagctcaa 
ccatattaca 
gttteggtec 
tggttgcagt 
agegttcett 
ggacaaacag 
agctgtaggg 



gcatctaacc 
agaatctgat 
agatgeccaa 
agatgacttg 
aaaggaattg 
tcaagc tctt 
acaaatacag 
ggaggagaag 
aattttgetc 
ttctttacaa 
gtctgaatat 
gtgtgaggac 
aaaggaatgg 
atcatcagaa 
gcttaccagt 
atccttgaaa 
agaaaaccaa 
actgaagttg 
acagtgtaaa 
acccgtccca 
attgtggtag 
aacagccatc 
gagtcegtae 
aaccattttc 
teattgettt 



aaageggtag 60 
ttttcagata 120 
atggatgagc 18 0 
cagggtgeae 240 
atcgaagccc 3 00 
ttggaagaag 3 60 
gttcttcaag 420 
gacagtgaaa 480 
cttcatcaag 540 
gaagagctta 60 0 
gagaaagaaa 660 
cagcagagag 720 
aatgcattgg 780 
ctgcaacggc 84 0 
gatctcagca 900 
gaacagcatc 960 
gcaaaggatg 102 0 
aagtttgaaa 1080 
aacaacctga 114 0 
geegtattea 1200 
agaaagaaac 12 60 
gtgctgtacg 1320 
accgtcctcc 1380 
ttcctcttta 1440 
aaattatata 1500 
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aaatgtatct gtctataaag aagatataaa attgtgactt tattctactg taagcaataa 1560 

tttgcttgca atttttcatg taatttttaa attagtatgt taagattctg aatatcatgg 1620 
tggcctaaag taggcttctt ggtacaccag attatttata acattaaatt tatgagtatt 1680 
ttactctgaa ttctgatcac cagataattc atttttctga tttgataact acccaaacca 1740 

aacaaccaat acatacatgg gaagagaggc cctgtgtgct cagtgcctat cagtttgaaa 1800 

tatatacaca tatatatatt ttttacatat ttacgccatt tttacttgtt atgaaaccac 1860 

tgagctattg ggaacaagac ttagagacaa ctatttgcgt ggattttttt ttttttaagg 1920 
taa " 1923 

<210> 34 
<211> 837 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 4628933CB1 

<400> 34 

atcggttagc gccttgccat gattaatcca gagctgcggg atggcagagc tgatggcttc 60 

atacatcgga tagttcccaa gttgatacaa aactggaaga ttggccttat gtgcttcctg 120 

agtattatta ttactacagt ttgcattatt atgatagcca catggtccaa gcatgctaaa 180 

cctgtggcat gttcagggga ctggcttgga gtgagagata agtgtttcta tttttctgat 240 

gataccagaa attggacagc cagtaaaata ttttgtagtt tgcagaaagc agaacttgct 3 00 

cagattgata cacaagaaga catggaattt ttgaagaggt acgcaggaac tgatatgcac 3 60 

tggattggac taagcaggaa acaaggagat tcttggaaat ggacaaatgg caccacattc 420 

aatggttggc catcaaactc caaatggtct tgcaactgga gcctccgaca atggcttctt 480 

ctgctgggac cccttagata ggcctctgag ggagctctga ctgccgtttc cccaaaacaa 540 

tgtcccctgt cagcaggaag cagttaagat caatcttcat ccttatccca atccttaata 600 

taacggcagt tagatgtact tctttagagt ggggaatgag acagccaagt aaaaagaggt 660 

ccctggagaa actccgactg gtccgcgcac tggggtggag cctcgggagg tttacccgtt 720 

tgcagaggga ggagcttgtc ctctcccgta ccagggtggt aacctgggga ttcaatctgt 780 

gagatggggg cctgttaaca ggcacctctc tagctttact gaattttttt ccttttc 837 
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DECLARATION AND POWER OF ATTORNEY FOR 
UNITED STATES PATENT APPLICATION 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, and 

I believe that I am the original, first and sole inventor (if only one name is listed below) or an 
original, first and joint inventor (if more than one name is listed below) of the subject matter which is 
claimed and for which a United States patent is sought on the invention entitled 

HUMAN MEMBRANE- ASSOCIATED PROTEINS 

the specification of which: 
/ X / is attached hereto. 

/ / was filed on as application Serial No. and if this box 

contains an X / /, was amended on . 

/X / was filed as Patent Cooperation Treaty international application No. PCT/USOO/10884 on 
April 20, 2000, if this box contains an X /_/, was amended on under Patent Cooperation Treaty Article 
19 on 2001 , and if this box contains an X /_/, was amended on . 

I hereby state that I have reviewed and understand the contents of the above-identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge my duty to disclose information which is material to the examination of this 
application in accordance with Title 37, Code of Federal Regulations, § 1.56(a). 

I hereby claim the benefit under Title 35, United States Code, § 1 19 or §365(a)-(b) of any 
\ foreign application(s) for patent or inventor's certificate indicated below and of any Patent Cooperation 

\ Treaty international applications(s) designating at least one country other than the United States 

indicated below and have also identified below any foreign application(s) for patent or inventor's 
certificate and Patent Cooperation Treaty international application(s) designating at least one country 
\ other than the United States for the same subject matter and having a filing date before that of the 

application for said subject matter the priority of which is claimed: 
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Counti-y 



Number 



Filing Date 



Docket No.: PF-0698 USN 

Priority Claimed 
/_/Yes //No 
//Yes //No 



I hereby claim the benefit under Title 35, United States Code, § 1 19(e) of any United States 
provisional application(s) listed below. 



Application 
Serial No. 



Filed 



60/130,694 



April 23. 1999 



60/140,580 



June 23, 1999 



Status (Pending, 
Abandoned, Patented) 

Expired 

Expired 



I hereby claim the benefit under Title 35, United States Code, § 120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this application is 
not disclosed in said prior application(s) in the manner required by the first paragraph of Title 35, 
United States Code §1 12, 1 acknowledge my duty to disclose material information as defined in Title 
37 Code of Federal Regulations, § 1.56(a) which occurred between the filing date(s) of the prior 
applications) and the national or Patent Cooperation Treaty international filing date of this application: 



Application 
Serial No. 



Filed 



Status (Pending, 
Abandoned, Patented) 



I hereby appoint the following: 

Lucy J. Billings 
Michael C. Cerrone 
Diana Hamlet-Cox 
Richard C. Ekstrom 
Barrie D. Greene 
Lynn E. Murry 
Shirley A. Recipon 
Susan K. Sather 
Michelle M. Stempien 
David G. Streeter 
P. Ben Wang 



Reg. No. J36,749 
Reg. NoJ39,132 
Reg. No,, 33,3 0? 
Reg. No ^37, 027 
Reg. N o. 46,740 
Reg. No^42^1S 
Reg. No,4ZJU£ 
Reg. No^^l^ 
Reg. No. 41,327 
Reg. NQ30£g 
Reg. No.~4J^420 



respectively and individually, as my patent attorneys and/or agents, with fall power of substitution and 
revocation, to prosecute this application and to transact all business in the Patent and Trademark Office 
connected therewith. Please address all communications to: 
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LEGAL DEPARTMENT 
INCYTE GENOMICS, INC . 
3160 PORTER DRIVE, PALO ALTO, CA 94304 

TEL: 650-855-0555 FAX: 650-849-8886 or 650-845-4166 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by fine 
or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any patent issuing thereon. 



First Joint Inventor: 



Full name: 
Signature: 
Date: 

Citizenship: 
Residence: 
P.O. Address: 



Jennifer L. Hillman 




United States 



Mountain View, California 



230 Monroe Drive, #17 
Mountain View, California 94040 



Second Joint Inventor: 



Full name: 
Signature: 
Date: 

Citizenship: 
Residence: 
P.O. Address: 




Olga Bandman 



tJ £ 4^fsZtAJ>&&L„ , 2001 



United States 



Mountain View, California 



366 Anna Avenue 

Mountain View, California 94043 
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Third Joint Inventor: 




Full name: 
Signature: 
Date: 

Citizenship: 
Residence: 
P.O. Address: 



Y. Tom Tang 



. /I'D* ' 




,2001 



United States 



San Jose^ California 



4230 Ranwick Court 

San Jose, California 95118 



Fourth Joint Inventor: 




Full name: 
Signature: 
Date: 

Citizenship: 
Residence: 
P.O. Address: 



Preeti Lai 




Sej>^c^Ji^^ /o a 2001 



India 



Santa Clara, California 



P.O. Box 5 142 

Santa Clara, California 95056 



Fifth Joint Inventor: 




Full name: 
Signature: 
Date: 

Citizenship: 
Residence: 
P.O. Address: 



Henry Yue 




United States 



Sunnyvale, California 



826 Lois Avenue 
Sunnyvale. California 94087 
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Sixth Joint Inventor: 



if 



Full name: 
Signature: 
Date: 

Citizenship: 
Residence: 
P.O. Address: 
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Roopa Reddy 




2001 



India 



Sunnyvale, California 



1233 W. McKinley Avenue, #3 
Sunnyvale, California 94086 



Seventh Joint Inventor: 




Full name: 
Signature: 
Date: 

Citizenship: 
Residence: 
P.O. Address: 



Yalda Azimzai 




■£fW?^/g 7.001 



United States 



iaO^ 



Castro Valley, California* 



5518 Boulder Canyon Drive 
Castro Valley, California 94552 



Eighth Joint Inventor: 




Mariah R . Baughn 



£ -2001 

United States 



,-&aft4^andro, California 



14244 Santiago Road 

San Leandro, California 94577 
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